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Preface by Freeman J. Dyson: 
Quasi-Crystals and the Riemann Hypothesis 


On the occasion of the 190 years from Riemann’s birth, which led to the conception 
of this book, I am presenting a challenge to young mathematicians to study 
quasicrystals and use them to prove the Riemann Hypothesis. Among the readers, 
those who are experts in number theory may consider the challenge frivolous. Those 
who are not experts may consider it uninteresting or unintelligible. Nevertheless, I 
am putting it forward for your serious consideration. One should always be guided 
by the wisdom of the physicist Leo Szilard, who wrote his own Ten Commandments 
to stand beside those of Moses. Szilard’s second commandment says, “Let your 
acts be directed toward a worthy goal, but do not ask if they will reach it; they are 
to be models and examples, not means to an end.” The Riemann Hypothesis is a 
worthy goal, and it is not for us to ask whether we can reach it. I will give you 
some hints describing how it might be achieved. Here, I will be giving voice to the 
mathematician that I was almost 70 years ago before I became a physicist. 

There were until recently two supreme unsolved problems in the world of pure 
mathematics, the proof of Fermat’s Last Theorem and the proof of the Riemann 
Hypothesis. In 1998, my former Princeton colleague Andrew Wiles polished off 
Fermat’s Last Theorem, and only the Riemann Hypothesis remains. Wiles’ proof 
of the Fermat Theorem was not just a technical stunt. It required the discovery and 
exploration of a new field of mathematical ideas, far wider and more consequential 
than the Fermat Theorem itself. It is likely that any proof of the Riemann Hypothesis 
will likewise lead to a deeper understanding of many diverse areas of mathematics 
and perhaps of physics too. The Riemann Hypothesis says that one specific function, 
the zeta-function that Riemann named, has all its complex zeros upon a certain line. 
Riemann’s zeta-function and other zeta-functions similar to it appear ubiquitously 
in number theory, in the theory of dynamical systems, in geometry, in function 
theory, and in physics. The zeta-function stands at a junction where paths lead in 
many directions. A proof of the hypothesis will illuminate all the connections. Like 
every serious student of pure mathematics, when I was young, I had dreams of 
proving the Riemann Hypothesis. I had some vague ideas that I thought might lead 
to a proof, but never pursued them vigorously. In recent years, after the discovery 
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of quasicrystals, my ideas became a little less vague. I offer them here for the 
consideration of any young mathematician who has ambitions to win a Fields 
Medal. 

Now, I jump from the Riemann Hypothesis to quasicrystals. Quasicrystals were 
one of the great unexpected discoveries of recent years. They were discovered in 
two ways, as real physical objects formed when alloys of aluminum and manganese 
are solidified rapidly from the molten state and as abstract mathematical structures 
in Euclidean geometry. They were unexpected, both in physics and in mathematics. 
Iam here discussing only their mathematical properties. 

Quasicrystals can exist in spaces of one, two, or three dimensions. From the point 
of view of physics, the three-dimensional quasicrystals are the most interesting, 
since they inhabit our three-dimensional world and can be studied experimentally. 
From the point of view of a mathematician, one-dimensional quasicrystals are more 
interesting than two-dimensional quasicrystals because they exist in far greater 
variety, and the two-dimensional are more interesting than the three-dimensional 
for the same reason. The mathematical definition of a quasicrystal is as follows. A 
quasicrystal is a distribution of discrete point masses whose Fourier transform is a 
distribution of discrete point frequencies. Or to say it more briefly, a quasicrystal is 
a pure point distribution that has a pure point spectrum. This definition includes as a 
special case the ordinary crystals which are exactly periodic point distributions with 
exactly periodic point spectra. 

Excluding the ordinary crystals, quasicrystals in three dimensions come in very 
limited variety, all of them being associated with the icosahedral rotation group. 
The two-dimensional quasicrystals are more numerous, roughly one distinct type 
associated with each regular polygon in a plane. The two-dimensional quasicrystal 
with pentagonal symmetry is the famous Penrose tiling of the plane, discovered 
by Penrose before the general concept of quasicrystal was invented. Finally, the 
one-dimensional quasicrystals have a far richer structure since they are not tied 
to any rotational symmetries. So far as I know, no complete enumeration of one- 
dimensional quasicrystals exists. It is known that a unique quasi-crystal exists 
corresponding to every Pisot-Vijayaraghavan number or PV number. A PV number 
is a real algebraic integer, a root of a polynomial equation with integer coefficients, 
such that all the other roots have absolute value less than one (see [1]). The set of 
all PV numbers is infinite and has a remarkable topological structure. The set of 
all one-dimensional quasicrystals has a structure at least as rich as the set of all PV 
numbers and probably much richer. We do not know for sure, but it is likely that a 
huge universe of one-dimensional quasicrystals not associated with PV numbers is 
waiting to be discovered. 

Here comes the punch line of my argument, the essential point linking one- 
dimensional quasicrystals with the Riemann Hypothesis. If the Riemann Hypothesis 
is true, then the zeros of the zeta-function form a one-dimensional quasicrystal 
according to the definition. They constitute a distribution of point masses on a 
straight line, and their Fourier transform is likewise a distribution of point masses, 
one at each of the logarithms of ordinary prime-power numbers. Andrew Odlyzko 
(see [2]) has published a beautiful computer calculation of the Fourier transform 
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of the zeta-function zeros. The calculation shows precisely the expected structure 
of the Fourier transform, with a sharp discontinuity at every logarithm of a prime- 
power number and nowhere else. 

My proposal is the following. Let us pretend that we do not know that the 
Riemann Hypothesis is true. Let us tackle the problem from the other end. Let 
us try to obtain a complete enumeration and classification of all one-dimensional 
quasicrystals. That is to say, we enumerate and classify all point distributions that 
have a discrete point spectrum. We shall then find the well-known quasicrystals 
associated with PV numbers and also a whole slew of other quasicrystals, known 
and unknown. Among the slew of other quasicrystals, we should be able to identify 
one corresponding to the Riemann zeta-function and one corresponding to each of 
the other zeta-functions that resemble the Riemann zeta-function. Suppose that we 
can prove rigorously that one of the quasicrystals in our enumeration has properties 
that identify it with the zeros of the Riemann zeta-function. Then we have proved 
the Riemann Hypothesis and can wait for the telephone call announcing the award 
of the Fields Medal. 

These are of course idle dreams. The problem of classifying one-dimensional 
quasicrystals is horrendously difficult, probably at least as difficult as the problems 
that Andrew Wiles took 7 years to fight his way through. But still, the history 
of mathematics is a history of horrendously difficult problems being solved by 
young people too ignorant to know that they were impossible. The classification 
of quasicrystals is a worthy goal and might even turn out to be achievable. Problems 
of that degree of difficulty will not be solved by old men like me. Let the young 
people now have a try. 
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An Introduction to Riemann’s Life, His 
Mathematics, and His Work on the Zeta 
Function 


Roger Baker 


Abstract Although the zeta function was first defined and used by Euler, it is to 
Bernhard Riemann, in an article written in 1859, that we owe our view of the 
zeta function as a meromorphic function in the plane with a functional equation. 
Riemann is a very remarkable figure in the history of mathematics. The present 
article describes his career including the major mathematical highlights, and gives 
some discussion of his published and unpublished work on the zeta function. 


1 Introduction 


In 1846 a frail and painfully shy youth of nineteen arrived in Gottingen, in the 
Kingdom of Hanover, to begin his university studies. Bernhard Riemann was to 
spend most of his adult life in G6ttingen as an undergraduate, doctoral candidate, 
Privatdozent (junior university instructor), extraordinary professor (roughly, 
associate professor), and full professor. The best of his research papers, written 
in the 1850s, were among the very most influential papers in the mathematics of his 
century. In particular, in a paper written in 1859, he set Euler’s zeta function (as he 
would have referred to it) on a modern footing. Today, zeta functions are perhaps 
the part of number theory that we would most like to understand better, as indeed 
the publication of the present volume testifies. 

Georg Friedrich Bernhard Riemann was born on 17 September 1826 in 
Breselenz, in the Kingdom of Hanover. His father Friedrich was the local pastor; 
later he took over the parish of Quickborn, not far from the Free Imperial City of 
Hamburg, but still in the Kingdom of Hanover. Bernhard was tutored by his father 
up to the age of thirteen, when he moved to the city of Hanover; he lived with his 
grandmother and attended high school for 2 years. After her death, he attended 
the high school in Liineburg, which is south-east of Hamburg. He was not notably 
successful except in mathematics. (At this time he read Legendre’s Théorie des 
Nombres [12].) 
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On arriving in Gottingen, Riemann initially matriculated as a student of 
philosophy and theology, perhaps at the time thinking of following his father’s 
profession. Certainly Riemann was very devout. But he soon focused on 
mathematics. 

In the summer of 1846, Riemann took Moritz Stern’s course on numerical 
solution of equations and Carl Goldschmidt’s course on terrestrial magnetism. Stern 
(1807-1894) and Goldschmidt (1807-1851) were both Gauss’s students. Stern was 
the first Jewish full professor in Germany and is known for the Stern—Brocot tree, 
with vertices corresponding to the positive rationals. Goldschmidt was Gauss’s 
assistant at the University Observatory and also helped Gauss gather data comparing 
the prime number counting function 


aoy= 1 


psx 
to the logarithmic integral 


: “ dt 
lix= —— 
o logt 
(this is a Cauchy principal value). Carl Friedrich Gauss (1777-1855) was deservedly 
famous for many contributions to mathematics, geodesy, and physics. By this time 
the elderly Gauss, who had been at Gottingen since 1807, would only give classes 
on his method of least squares, which included introductory linear algebra. Riemann 
took this course in winter 1846-1847, along with Stern’s course on definite integrals. 
Presumably in search of deeper mathematical content, Riemann spent the next 2 
years, 1847-1849, at the University of Berlin. He attended Dirichlet’s courses in 
number theory, the theory of definite integrals, and partial differential equations; 
Jacobi’s lectures in analytical mechanics and linear algebra; and Eisenstein’s 
lectures on the theory of elliptic functions. 

We pause here to discuss the influence of these three fine mathematicians 
on Riemann. F.G.M. Eisenstein (1823-1852) worked mostly in number theory. 
Riemann recounted to Dedekind that they had discussed the use of complex numbers 
in the theory of functions, which seemed important to Riemann, and their views 
differed so radically that there was not much further interaction. C.G.J. Jacobi 
(1804-1851) was famous for his contributions to the theory of elliptic functions, 
which he based on four theta functions. He also worked on number theory and partial 
differential equations and dynamics. His collected works fill eight large volumes. 

Riemann was certainly very interested in the work on elliptic functions, as 
we shall see. However, the key person for his evolution as a mathematician was 
P.G.L. Dirichlet (1805-1859). Dirichlet was a brilliant lecturer and some of his 
lectures on number theory are still extant [5]. He was regarded by Jacobi as the only 
fully rigorous analyst. His paper of 1829 on pointwise convergence of Fourier series 
was a model of precision; this would be the area of Riemann’s habilitation thesis 
of 1854. His introduction of “Dirichlet characters” and the associated L functions 


An Introduction to Riemann’s Life, etc. 3 


[treated as functions on (0, 0o)] were needed for his proof that there are infinitely 
many primes in any arithmetic progression a (mod q) with (a,q) = 1; this was 
completed in a paper of 1839-1840. Dirichlet’s wife Rebecka was the sister of the 
great Felix Mendelssohn-Bartholdy. 

We mention briefly the political events of 1848 and beyond. At this time Germany 
was a loosely associated group of 38 states including Hanover; the most important 
were Prussia, with Berlin as the capital, and Austria, with Vienna as the capital. 
Many of these states and other parts of Europe were shaken by revolution in 
1848-1849. Ultimately, the revolution in German states was put down by military 
force, but modernization of political systems followed gradually. In 1862, Otto 
von Bismarck became Prussian prime minister. His skills, and Prussian military 
prowess, led by 1871 to the unification of Germany as an empire excluding Austria— 
Hungary. Bismarck was the Empire’s Chancellor from 1871 to 1890. Many of these 
events occurred after Riemann’s death, but as we shall see, he was affected by the 
war of 1866. In this war, Prussia (which had proposed a radical and unwelcome 
reshaping of the German confederation) faced Austria, which allied with most of 
the “third Germany” (states other than Prussia and Austria). Prussia won the battle 
of Koniggratz (July 1866) and then annexed much north German territory, including 
the Kingdom of Hanover. 

Returning to March 1848, Riemann was loyal to the establishment and, as a 
member of the student corps, was on guard duty at the Royal Palace from 9 a.m. on 
24 March to | p.m. the next day. This seems to have been the only time he took sides 
in political matters. 

At Easter 1849 Riemann returned to Gottingen and attended courses of lec- 
tures on philosophy and experimental physics. Here his teacher, Wilhelm Weber 
(1804-1891) was a remarkable man who also had much influence on Riemann. 
Weber became a professor of physics in Gottingen in 1831 at the instigation of 
Gauss. Gauss and Weber published important joint work on magnetism. Their 
famous telegraph, the first effective working one, dated from 1833 and permitted 
simultaneous magnetic observations at the Physical Laboratory and the Astronom- 
ical Observatory two miles away. There was an interruption in Weber’s work in 
1837 when, with six other professors, he was dismissed for a protest concerning 
the revocation of the Hanoverian constitution of 1833. After traveling, and working 
at Leipzig, he was reinstated in 1849. His later work focused on electrodynamics. 
In 1850 a seminar in mathematical physics was formed, led by Weber, in which 
Riemann joined. To us it is remarkable that he could be heavily involved in 
experimental work at the same time he was preparing his groundbreaking doctoral 
thesis, “Foundations for a general theory of functions of a complex variable.” 
Riemann published papers in physics throughout his short life. His paper [23] seems 
to be the first in which it is asserted that electromagnetic effects propagate at the 
speed of light. In his paper [19] on propagation of waves in a compressible gas, the 
concept of a shock wave appears for the first time. 

Of incomparably greater importance for Riemann’s legacy is the doctoral thesis 
[14]. (It seems that Gauss, the formal thesis adviser, gave Riemann absolutely no 
help. Gauss’s remark in the thesis interview, that he had long been contemplating 
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a publication on a related topic, must surely have given Riemann a fright!) 
The most important matters in the thesis going beyond Cauchy’s work are the 
idea of a Riemann surface, the treatment of complex functions as mappings, 
with a conformal property, the use of the “Dirichlet principle” (a minimization 
technique for existence theorems), and the famous “Riemann mapping theorem.” 
Not everything was rigorous, and much later work by others stems from trying to 
inject the desired rigor into the results. Riemann used the above ideas in several 
important papers, as we shall see below, quite apart from his application of complex 
analysis to the zeta function. 

Another hurdle remained before Riemann could teach in the university as a 
Privatdozent. This very junior position was, disagreeably, not salaried and the 
only income came from the fees paid by students who attended the Privatdozent’s 
lectures. There was no option, however, but to begin this way if Riemann wanted 
to be a university teacher. The obstacle that remained was the habilitation thesis. 
The topic, the theory of trigonometric series, was suggested by Dirichlet, who 
spent some time in G6ttingen in the autumn of 1852. Dirichlet provided Riemann 
with references to previous work. Such references were dreadfully lacking in 
Riemann’s doctoral thesis; this was not apparently a grievous omission, as it would 
be today. Dirichlet and Riemann held some of their discussions at Weber’s house. In 
December 1853 Riemann turned in his thesis “On the representation of a function 
by a trigonometric series” to the dean, but because Gauss was in very poor health, 
it was not until Whitsun 1854 that the examination and trial lecture took place. The 
trial lecture was to become a landmark in the history of geometry, but we must first 
of all look at the habilitation thesis [21], which was also an important contribution 
to pure mathematics. The thesis was published posthumously by Dedekind from a 
complete manuscript in the “Nachlass,” the large body of miscellaneous writings left 
a Riemann’s death, and still available for study at the university library in Gottingen. 

Riemann was concerned in the thesis with necessary conditions for the repre- 
sentation of a function by a trigonometric series. His idea of using the function 
F obtained by two integrations of the trigonometric series was new and remains 
significant: see Zygmund [27, Chap. 9]. Riemann found it necessary to give a 
rigorous account of integration on the real line which was far superior to earlier 
work, in the thesis. Indeed, the Riemann integral is standard fare for undergraduates 
today. Other highlights included the “Riemann-Lebesgue lemma,” i.e., the vanishing 
at infinity of the Fourier coefficients of an integrable function; and the earliest use 
(it seems) of the principle of stationary phase in §13. The thesis was especially 
significant for its influence on Georg Cantor’s work, but that takes us well beyond 
Riemann’s lifetime. 

The “trial lecture” for Riemann’s habilitation, a colloquium of the philosophical 
faculty, was held on June 10, 1854. The title was “On the hypotheses on which 
geometry is based.” Riemann described a general mathematical object, an n- 
dimensional differentiable manifold. The metric was based on a smoothly varying 
inner product on the tangent space (a “Riemannian metric”). He indicated that this 
was conceivably an accurate description of physical space (rather than the simplest 
case, Euclidean space), an idea the audience would have automatically detested. He 
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went to considerable lengths to avoid formulas (there is only one in the published 
form of the whole paper). After being published from the Nachlass in 1868 [22], the 
lecture stimulated differential geometers for the rest of the century. Indeed, deep new 
results in Riemannian geometry continue to appear, such as Gromov’s “almost flat 
manifold theorem” of 1978 [8]. There is also a certain relation to the mathematical 
basis of the theory of general relativity, published by Einstein in 1915, which surely 
would have delighted Riemann. 

Readers might expect that after obtaining powerful new results in each of his 
two theses and the trial lecture, Riemann would have been invited to speak at 
conferences and seminars. This is to misunderstand the nature of nineteenth century 
mathematics research. Gottingen, one of the best universities in the world in the 
1850s, had only one annual scientific meeting, which covered the natural sciences 
and medicine. Colloquium lectures by visiting scientists were virtually unknown. In 
mathematics, Felix Klein and Georg Cantor put forward the idea of an international 
congress only in the 1890s, the first one taking place in Zurich in 1897. Of course, 
there was some travel by individual mathematicians to meet colleagues in other 
states and countries; we shall see some more of this as the story continues. 

Richard Dedekind (1831-1916), Gauss’s last student, completed his habilitation 
just a few weeks after Riemann. At this stage, his work was at a rather pedestrian 
level, but it was not too long before his extraordinary talents came to the fore. He 
gave the first rigorous construction of the real numbers in 1858 using “Dedekind 
cuts,” although this was not published until 1872. His work on algebraic number 
fields is well known and of major importance, indeed its influence extends to abstract 
algebra as a whole. He was the first to prove the “Schréder-Bernstein theorem” of 
1897, on comparability of infinite cardinal numbers. Dedekind obtained the result 
in 1887. It is remarkable that he was reluctant to publish such outstanding results. 
Going back to the 1850s, the two young instructors became friends, and the sociable 
and cheerful Dedekind was invaluable in lifting Riemann’s often plummeting spirits. 
We have records of the early courses taught by both scholars, and it seems that 
Galois theory was taught in a university for the first time by Dedekind in winter 
1856/1857. Riemann, meanwhile, lectured regularly on his new complex function 
theory, among other topics. 

Riemann’s stock rose considerably when he published “Contributions to the 
theory of functions represented by the Gauss series F(a, f, y, x)” in the proceedings 
of the Gottingen Academy in 1857 [15]. He provided an entirely new approach to 
the hypergeometric function, studied by Gauss [7] and Kummer [10]. He specified 
the P-functions geometrically and used (in effect) groups of matrices to specify the 
behavior of P-functions under circuits of the branch points. All the results obtained 
via Kummer’s laborious derivations, much admired at the time, are obtained “‘almost 
without calculation” as Riemann remarks in an announcement of the paper [16]. The 
paper influenced later work of Fuchs and Hilbert, among others. 

Even more important for Riemann’s reputation was his treatment of a hot topic in 
“The theory of Abelian functions” in Borchardt’s Journal, 1857 [17]. The underlying 
idea is the study of the integrals of a rational function 
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where y is an algebraic function, say F(x, y) = 0. Jacobi had shown in 1834 that 
if F is nonsingular of degree greater than 3, there will be more than two periods 
to such an integral. Weierstrass had treated F(x, y) = y? — f(x), degf = 2n+ 1, 
in papers of 1853 and 1856. Riemann used entirely different methods, requiring 
Riemann surfaces and the Dirichlet principle. See Weyl [26] for a rigorous account 
of Riemann’s results, an assessment of which could take us much too far afield 
here. In particular, there are key concepts in topology and algebraic geometry that 
derive their lineage from this crucial paper. The effort he put into it sent him into a 
state of mental exhaustion, and Dedekind describes in [4] how a walking trip in the 
Harz mountains that the two of them undertook restored Riemann’s spirits. It seems 
that they discussed Newton rather than pure mathematics. Dedekind deserves the 
gratitude of posterity for his efforts. 

On 23 February 1855, Gauss had died and Dirichlet had succeeded him in 
Gottingen. Riemann’s father and his sister Clara also died in this year. It was just 
as well that Riemann was given at this time a small stipend of 200 thalers a year 
from the government, since he had never been able to avoid accepting financial help 
from his father. Arguably the three best mathematicians in the world were together 
in G6ttingen in the mid-1850s, but Dirichlet’s health gave way before too long and 
Dedekind was to leave for a professorship at the Polytechnic school in Ziirich in 
1858. (He spent the whole of his career from 1862 in his native Braunschweig.) In 
1857, Riemann was promoted to extraordinary professor and his stipend increased 
to 300 thalers a year. But then his spirits were crushed by the death of his beloved 
brother Wilhelm, a clerk in the post office. Riemann now needed to provide for the 
upkeep of his three remaining sisters. (At this time the doors of the universities and 
professions were firmly closed to women, and the millions of women who worked 
only had access to menial jobs such as farm work, domestic service, and sweatshops 
[1].) After the sisters moved to Gottingen, the youngest, Marie, died in the spring 
of 1858. Nevertheless, Riemann gradually became happier and more self-confident 
living with family; he had always made trips back to see his family whenever he 
could. 

In the autumn of 1858, Riemann got to know the Italian mathematicians 
Francesco Brioschi (1824-1897), Enrico Betti (1823-1892), and Felice Casorati 
(1835-1890), who were on a tour that took in Paris, Berlin, and Gottingen. These 
men were hoping to build up Italian mathematics and intended to gain from their 
contacts with the best foreign mathematicians. Along with Eugenio Beltrami (1835- 
1900), a student of Brioschi, they all played a significant part in the rise of Italian 
mathematics in the following years. This was also the starting point of Riemann’s 
contacts with Italy, which were so valuable to him in his last years. 

Dirichlet died in May 1859 after a long period of heart problems. Riemann 
was now deservedly appointed as his replacement and given living quarters at 
the Observatory. (The Géttingen Observatory retains its original appearance and 
since 2009 has housed the Lichtenberg-Kolleg Institute for advanced study in the 
humanities and social sciences.) Riemann was only thirty-two, and with good luck 
could have now looked forward to a prosperous and productive future beyond the 
end of the century. Sadly it was not to be. The reader may be astonished at the short 


An Introduction to Riemann’s Life, etc. 7 


lives of many of the characters in our narrative. It is well to note that for a man 
born in Germany in the 1870s, life expectancy was 36.5; for a woman, 38.5 [1]. 
Riemann’s lifespan of almost 40 years was not too bad when looked at in this way. 

In September 1859, Riemann visited the Berlin Academy of Sciences, which 
had recently elected him a corresponding member. Dedekind traveled with him, and 
he enjoyed a friendly welcome from the leading mathematicians Eduard Kummer 
(1810-1893), Carl Borchardt (1817-1880), Leopold Kronecker (1823-1891), and 
Karl Weierstrass (1815-1897). All of these will be familiar to the reader except 
perhaps Borchardt, who worked on the arithmetic-geometric mean problem (begun 
by Gauss) and continued Kummer’s work on the secular disturbances of the planets. 
In total he wrote 25 papers, as well as editing Crelle’s Journal from 1856 to 1880. 

Riemann’s visit was followed by the submission to the Berlin Academy of his 
paper [18] on zeta function and the frequency of prime numbers in October 1859. 
We defer discussion of this splendid paper until the end of our account of Riemann’s 
life. 

Like the Italian mathematicians, Riemann also visited Paris, spending a month 
there in 1860. This time he was able to meet the excellent scholars Joseph Serret 
(1819-1885), Joseph Bertrand (1822-1900), Charles Hermite (1822-1901), Victor 
Puiseux (1820-1883), Charles Briot (1817-1882), and Jean Claude Bouquet (1819- 
1885). Of these, Joseph Bertrand may be the least familiar. He worked on analytical 
mechanics, thermodynamics, statistical probability, and the theory of curves and 
surfaces, as well as editing Journal des Savants (1865-1900). He also created 
“Bertrand’s postulate”: for any n > 3 there is a prime number p in (n, 2n — 2). 
This was proved by Chebyshev in 1852. 

Riemann was finally well enough off to contemplate marriage, and in June 1862 
he married Elise Koch, a friend of his sisters. Now his health began to deteriorate, 
and she was his great support through his remaining 4 years. In July 1862 he had an 
attack of pleurisy, and this seems to have led to the lung disease that shortened his 
life. On the advice of Riemann’s doctor, Weber and the geology professor Sartorius 
von Waltershausen (1809-1876) pressed the authorities to give Riemann leave and 
financial support for a visit to Italy that would benefit his health. Riemann and Elise 
left in November 1862 and initially stayed with the consul in Messina, who was a 
friend of Sartorius. The long visit to Italy allowed him to visit many of the most 
famous cities and enjoy the art treasures and antiquities, as well as spending time 
in discussions with Enrico Betti in Pisa. Dedekind believed that Riemann enjoyed 
the greatest happiness of his life on his visits to Italy, partly because the people he 
encountered were so relaxed and informal. 

Not long after his return, his health was so bad that he began a second voyage 
to Italy in August 1863. When they were in Pisa, Elise gave birth to their daughter 
Ida. In May 1864 he took a villa in Pisa, and here at the end of August, his younger 
sister Helene died. He was offered an appointment at the University of Pisa, but felt 
his health was too precarious to accept. The next winter was passed in Pisa with 
his learned friends Betti, Beltrami, and others. He was now working on the memoir 
on the vanishing of theta functions [20]. In spite of very poor health, he went back 
to Gottingen, hoping to resume his teaching, arriving on 3 October 1865. His last 


8 R. Baker 


scientific work was a memoir on the mechanism of the ear, which he was unable to 
finish. On 15 June 1866, he and Elise set off to Italy. Their journey was interrupted 
at Cassel, where the war mentioned above had caused the destruction of the railway; 
but they did manage to complete the journey to Lake Maggiore, where he worked 
quietly, and enjoyed the scenery, until his death on 20 July, with Elise at his side. 

For further reading about Riemann’s life, see the book of Laugwitz [11] and the 
references therein. 


2 The Zeta Function 


We now turn to Riemann’s memoir on the zeta function. Euler had given many 
results in number theory using product formulae; one of these was the formula for 


CoO 


f)=)in* (we (1,00), 


n=1 


namely 
cé)=[[G-py" @>D. 
p 


He used this to show that }> p~! diverges. Riemann viewed ¢(s) as a function of a 
Pp 
complex variable and used contour integration to show that 


aig afl _ ol an ee ee 
m2 r(5s)to=2 + | (= ee oe ) o@ax (1) 


[exe) 

an 

w(x) = y em 
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is related to a Jacobi theta function. Since the right side is unchanged if we replace 
s by 1 —, this gives both the analytic continuation of ¢(s) over the whole plane 
and the functional equation for €(s). We also easily obtain the set of zeros of 
f(s) in Res < 0, namely {—2,—4,—6,...}. Riemann gives a second proof of 
the functional equation. His next four assertions about ¢(s) are backed up by a 
number of indications of proof; Davenport [3] describes them as conjectures, but 
Riemann (as we know from his correspondence with Weierstrass) thought that just 
some details needed filling in. 
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1. ¢(s) has infinitely many zeros in the critical strip. (These must clearly be placed 
symmetrically with respect to the real axis and with respect to the line Res = 5) 

2. The number N(T) of zeros of ¢(s) in the “critical strip”? 0 < Res < 1 with 
0 <t < T satisfies 


T T T 
N(T) = an log — + O(log T). 


21 
It was not until 1905 that von Mangoldt gave the full proof of this (he had the 
result, with a worse error term, by 1895). 

3. The entire function &(s) defined by 


1 


Es) = 5 (s— Dab P (55) £(s) 


has the product representation 


E(s) = Atay] (1 _ *) el? 


p 


where A and B are constants. Here and below, p runs through the zeros of f(s) 
in the critical strip. In order to get a rigorous proof, Jacques Hadamard (1865- 
1963), in the 1890s, developed the theory of entire functions of finite order, the 
distribution of their zeros, and their product representations. Hadamard’s proof 
of (3) dates to 1893. 

4. We can write out an explicit formula for (x) — lix in terms of the zeros of €(s). 
This looks a lot tidier in the form proved by von Mangoldt in 1895. Let A(n) 
denote log p if n is a prime power p*, and A(n) = 0 otherwise. Then, for x ¢ Z, 


where we must group p with p in the sum on the right. Riemann’s sketch proof 
involves an application of Fourier inversion but omits some key details. 


Riemann conjectures early in the paper that all complex zeros of €(s) lie on the 
“critical line” Re s = 5. He says that after a few brief and fruitless attempts to 
prove this, he has put it on one side for the time being, because it did not seem 
to be essential to the immediate object of his investigation. Of course the Riemann 
hypothesis is now one of the best known unsolved problems in all of mathematics. 
We do know that more than 41% of the zeros of the zeta function up to a large height 
T are on the critical line [2]. Riemann makes the assertion [18] that he can prove 
this with asymptotically 100% in place of 41%; this still seems out of reach. 

It certainly seems on reading Riemann’s memoir that he regards the asymptotic 
formula (conjectured by Gauss) 


a(x) ~ lix 
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as a consequence of his investigation, but this theorem was only proved rigorously 
in 1895, by both Hadamard and Charles de la Vallée Poussin (1866-1962). A 
key element of the proof of this theorem, the “prime number theorem,” is the 
demonstration that no zero of €(s) has real part 1. Davenport [3] is the best source 
for the proofs of the results described above. 

There is a remarkable postscript to Riemann’s published work on the zeta 
function. In 1932 Carl Ludwig Siegel [24] published an account of material about 
the zeta function that he had found in the Nachlass. Bear in mind that Dedekind 
and Heinrich Weber had already combed through the Nachlass to find anything that 
could be included, as a coherent result, in their two editions of Riemann’s collected 
works! There are two topics covered in Siegel’s paper. One is an asymptotic formula 
for the computation of the real function 


Z(t) = &ME (5 + i) 


where (ft) is defined so that the sign of ¢ (5 + it) is opposite to that of Z(t). The 
other topic is a new representation of ¢(s) in terms of definite integrals. Riemann 
attained the formula 


N —i0(t)+in/4 ,—t1/2 i ya ti —Nx 
1 e e x2 e dx 
Z(t) = ~22cos(@(t) — tlo + : i 

()= Son (00) — tog”) + Gg em J. I 


n=1 


where Cy is a certain path which begins and ends on the real axis at +00. This is 
amenable to the saddle point method; see Edwards [6, Chap. 9] for a discussion. 
As for the definite integral representation of €(s) mentioned above, this is said 
by Siegel to have been found in the Nachlass in 1926 by Erich Bessel-Hagen 
(1898-1946). Bessel-Hagen, a friend of Siegel, was then working on his habilitation 
at Géttingen. See [6, Chap. 7] for more details. We can certainly say that Riemann 
had obtained privately far more results about the zeta function than can be found 
in his memoir for the Berlin Academy, and that these led him to accurate empirical 
discoveries (such as the location of the first few zeros in the critical strip) that made 
his hypothesis on the zeta function plausible. 

For further study of the zeta function, besides the books of Davenport and 
Edwards mentioned above, there are interesting and well-written books by Titch- 
marsh [25], Ivic [9], and Patterson [13]. 

In summary, we can say that Riemann had tremendous insights in every branch 
of mathematics that he studied in any depth, and that he is quite an awe-inspiring 
figure. It is certainly still worthwhile to read his papers and to try and picture 
how his “fantastic premonitions” (as Dedekind describes them) looked to his 
contemporaries. 
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Ramanujan’s Formula for ¢(27 + 1) 


Bruce C. Berndt and Armin Straub 


Abstract Ramanujan made many beautiful and elegant discoveries in his short 
life of 32 years, and one of them that has attracted the attention of several 
mathematicians over the years is his intriguing formula for €(2n + 1). To be sure, 
Ramanujan’s formula does not possess the elegance of Euler’s formula for ¢(2n), 
nor does it provide direct arithmetical information. But, one of the goals of this 
survey is to convince readers that it is indeed a remarkable formula. In particular, 
we discuss the history of Ramanujan’s formula, its connection to modular forms, as 
well as the remarkable properties of the associated polynomials. We also indicate 
analogues, generalizations and opportunities for further research. 


1 Introduction 


As customary, €(s) = )-°2,n~*,Res > 1, denotes the Riemann zeta function. Let 
B,, r = 0, denote the r-th Bernoulli number. When 7 is a positive integer, Euler’s 
formula 


(—1)""'Bon 


2(2n)! (2x)" () 


§(2n) = 


not only provides an elegant formula for evaluating ¢(2n), but it also tells us of 
the arithmetical nature of ¢(2n). In contrast, we know very little about the odd zeta 
values €(2n + 1). One of the major achievements in number theory in the past 
half-century is R. Apéry’s proof that ¢(3) is irrational [2], but for n > 2, the 
arithmetical nature of ¢(2n + 1) remains open. 
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Ramanujan made many beautiful and elegant discoveries in his short life of 32 
years, and one of them that has attracted the attention of several mathematicians 
over the years is his intriguing formula for €(2n + 1). To be sure, Ramanujan’s 
formula does not possess the elegance of (1), nor does it provide any arithmetical 
information. But, one of the goals of this survey is to convince readers that it is 
indeed a remarkable formula. 


Theorem 1.1 (Ramanujan’s Formula for ¢(2n + 1)) Let B,, r > 0, denote the 
r-th Bernoulli number. If « and B are positive numbers such that aB = 1, and ifn 
is a positive integer, then 


—n 1 = 1 
a (5004 1) + x wae] 


m=1 


—n 1 ~ J 
—() (50 2, | 


m=1 


n+1 B B 
= 2?" ye 2 2n+2—2k nti-k pk 5 
do Qb! Qn+2—26!° B (2) 


Theorem 1.1 appears as Entry 21(i) in Chap. 14 of Ramanujan’s second notebook 
[58, p. 173]. It also appears in a formerly unpublished manuscript of Ramanujan that 
was published in its original handwritten form with his lost notebook [59, formula 
(28), pp. 319-320]. 

The purposes of this paper are to convince readers why (2) is a fascinating 
formula, to discuss the history of (2) and formulas surrounding it, and to discuss the 
remarkable properties of the polynomials on the right-hand side of (2). We briefly 
point the readers to analogues and generalizations at the end of our paper. 

In Sect. 2, we discuss Ramanujan’s aforementioned unpublished manuscript and 
his faulty argument in attempting to prove (2). Companion formulas (both correct 
and incorrect) with the same parentage are also examined. In the following Sect. 3, 
we offer an alternative formulation of (2) in terms of hyperbolic cotangent sums. 
In Sects. 4 and 5, we then discuss a more modern interpretation of Ramanujan’s 
identity. We introduce Eisenstein series and their Eichler integrals, and observe that 
their transformation properties are encoded in (2). In particular, this leads us to a 
vast extension of Ramanujan’s identity from Eisenstein series to general modular 
forms. 

In a different direction, (2) is a special case of a general transformation formula 
for analytic Eisenstein series, or, in another context, a general transformation 
formula that greatly generalizes the transformation formula for the logarithm of 
the Dedekind eta function. We show that Euler’s famous formula for ¢(2n) arises 
from the same general transformation formula, and so Ramanujan’s formula (2) is a 
natural analogue of Euler’s formula. All of this is discussed in Sect. 6. 
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In Sect. 7, we discuss some of the remarkable properties of the polynomials that 
appear in (2). These polynomials have received considerable recent attention, with 
exciting extensions by various authors to other modular forms. We sketch recent 
developments and indicate opportunities for further research. We then provide in 
Sect. 8 a brief compendium of proofs of (2), or its equivalent formulation with 
hyperbolic cotangent sums. 


2 Ramanujan’s Unpublished Manuscript 


The aforementioned handwritten manuscript containing Ramanujan’s formula for 
€(2n + 1) was published for the first time with Ramanujan’s lost notebook [59, 
pp. 318-321]. This partial manuscript was initially examined in detail by the 
first author in [12], and by Andrews and the first author in their fourth book on 
Ramanujan’s lost notebook [1, Chap. 12, pp. 265-284]. 

The manuscript’s content strongly suggests that it was intended to be a con- 
tinuation of Ramanujan’s paper [55], [57, pp. 133-135]. It begins with paragraph 
numbered 18, giving further evidence that it was intended to be a part of [55]. 
One might therefore ask why Ramanujan did not incorporate this partial manuscript 
in his paper [55]. As we shall see, one of the primary claims in his manuscript 
is false. Ramanujan’s incorrect proof arose from a partial fraction decomposition, 
but because he did not have a firm grasp of the Mittag—Leffler Theorem, he was 
unable to discern his mis-application of the theorem. Most certainly, Ramanujan 
was fully aware that he had indeed made a mistake, and consequently he wisely 
chose not to incorporate the results in this manuscript in his paper [55]. Ramanu- 
jan’s mistake arose when he attempted to find the partial fraction expansion of 
cot(./wa) coth( JwB ). We now offer Ramanujan’s incorrect assertion from his 
formerly unpublished manuscript. 


Entry 2.1 (p. 318, Formula (21)) [fa and B are positive numbers such that aB = 
x’, then 


CO 


ES re y ma coth(ma) re mp coth(mB) _ z oie) sot wBy. 3) 


2w wt+ma w—mB 


m=1 


We do not know if Ramanujan was aware of the Mittag—Leffler Theorem. 
Normally, if we could apply this theorem to cot(,/wa) coth(,/w), we would let 
w — oo and conclude that the difference between the right- and left-hand sides 
of (3) is an entire function that is identically equal to 0. Of course, in this instance, 
we cannot make such an argument. We now offer a corrected version of Entry 2.1. 


16 B.C. Berndt and A. Straub 


Entry 2.2 (Corrected Version of (3)) Under the hypotheses of Entry 2.1, 


. cot(./wa) coth(./wB) — = a ; log p (4) 


a 3S ma coth(ma) i. mB coth(mB) 


“= ( w+mo w—mp 


Shortly after stating Entry 2.1, Ramanujan offers the following corollary. 


Entry 2.3 (p. 318, Formula (23)) [fa and B are positive numbers such that aB = 
x, then 


= m = m a+Bp 1 
yee = = 5 
«2 gma tPh 2, opt ©) 


m=1 m=1 


To prove (5), it is natural to equate coefficients of 1/w on both sides of (3). 
When we do so, we obtain (5), but without the term —} on the right-hand side of (5) 
[1, pp. 274-275]. Ramanujan surely must have been perplexed by this, for he had 
previously proved (5) by other means. In particular, Ramanujan offered Entry 2.3 as 
Corollary (i) in Sect. 8 of Chap. 14 in his second notebook [58], [10, p. 255]. 

Similarly, if one equates constant terms on both sides of (4), we obtain the 


familiar transformation formula for the logarithm of the Dedekind eta function. 


Entry 2.4 (p. 320, Formula (29)) [fa and B are positive numbers such that aB = 
1, then 
Co Co 
1 1 1 a a-—Bp 
= = —] = : 6 
» m(e2me = 1) » m(e2mb = 1) 4 08 B 12 ( ) 


m=1 m=1 


Of course, if we had employed (3) instead of (4), we would not have obtained the 
expression 7 log S on the right-hand side of (6). Entry 2.4 is stated by Ramanujan 
as Corollary (11) in Sect. 8 of Chap. 14 in his second notebook [58], [10, p. 256] and 
as Entry 27(iii) in Chap. 16 of his second notebook [58], [11, p. 43]. 

In contrast to Entries 2.3 and 2.4, Ramanujan’s formula (2) can be derived 
from (3), because we equate coefficients of w”, n > 1, on both sides, and so the 
missing terms in (3) do not come into consideration. We now give the argument 
likely given by Ramanujan. (Our exposition is taken from [1, p. 278].) 


Proof Return to (3), use the equality 


cothx = 1+ 


e2x = 


and expand the summands into geometric series to arrive at 
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Following Ramanujan, we equate coefficients of w”, n > 1, on both sides of (8). On 
the right side, the coefficient of w” equals 


5 cot( ywer) coth( yw) = ure 


lore) 
=, —n 1 
(—a@) f(2n + 1) + 2(—a) ~ ment (e2ma = 1) 
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(9) 


Using the Laurent expansions for cot z and cothz about z = 0, we find that on the 
left side of (8), 


CO (_1)k92k 
5 cot ver) coth(./wB) = ; 2 (=1)'2" Bax ( war) is 2A ae py, 
(=0 


(2k)! (2j)! 
(10) 
The coefficient of w” in (10) is easily seen to be equal to 
n+1 B 
72nt1 2 a Box 2n+2—2k ti bperiak (11) 


(2k)! (2n + 2— 2k)! 


where we used the equality a8 = 2*. Now equate the expressions in (9) and (11), 
then multiply both sides by (—1)” ‘, and lastly replace k by n + 1 — k in the finite 
sum. The identity (2) immediately follows. Oo 

As mentioned in Sect. 1, Ramanujan also recorded (2) as Entry 21(i) of Chap. 14 
in his second notebook [58, p. 173], [10, p. 271]. Prior to that on page 171, he 
offered the partial fraction decomposition 


co oo 

th uw 
a xy cot(sx) coth(zry) =1+ 2mxy ) noe _ on xy) n co pee) 
Wey 


n=1 n=1 


’ 


a 

(12) 
which should be compared with (3). The two infinite series on the right side 
of (12) diverge individually, but when combined together into one series, the series 
converges. Thus, most likely, Ramanujan’s derivation of (2) when he recorded Entry 
21(i) in his second notebook was similar to the argument that he used in his partial 
manuscript. 
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Sitaramachandrarao [69] modified Ramanujan’s approach via partial fractions to 
avoid the pitfalls experienced by Ramanujan. Sitaramachandrarao established the 
partial fraction decomposition 


2 
mxy cot(sx) coth(ay) = 1+ 50° —x’) (13) 


‘ 3 y* coth(zmx/y) — x* coth(amy/x) 
ine ; 
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Using the elementary identities 
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and then employing (7), he showed that 
a2 
mxy cot(zx) coth(sry) = 1 + mala =2 (14) 
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Setting mx = ./wa and ry = JwB above and invoking the transformation formula 
for the logarithm of the Dedekind eta function from Entry 2.4, we can readily 
deduce (4). For more details, see [1, pp. 272-273]. 

The first published proof of (2) is due to Malurkar [49] in 1925-1926. Almost 
certainly, he was unaware that (2) can be found in Ramanujan’s notebooks [58]. 

If we set a = B = x and replace n by 2n + 1 in (2), we deduce that, for n > 0, 


2n+2 cae) —4n—3 
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This special case is actually due to Lerch [44] in 1901. The identity (15) is a 
remarkable identity, for it shows that €(4n + 3) is equal to a rational multiple of 
x*"*3 plus a very rapidly convergent series. Therefore, €(4n + 3) is “almost” a 
rational multiple of 24”"*3. 


3 An Alternative Formulation of (2) in Terms of Hyperbolic 
Cotangent Sums 


If we use the elementary identity (7), we can transform Ramanujan’s identify (2) for 
€(2n + 1) into an identity for hyperbolic cotangent sums, namely, 


—n YO coth(am) y th(Bm) 
“ pe eH = =i) > = a 
m=1 


n+l 


Bon+2—2k 
_72nt1 1* Box 2n+2—2 n+1-k ke 16 
Le GbiQnt2—-2I ?- (16) 


where, as before, 7 is a positive integer and aB = x7. To the best of our knowledge, 
the first recorded proof of Ramanujan’s formula (2) in the form (16) was by 
Nanjundiah [52] in 1951. If we replace n by 2n + 1 and seta = f = 7, then (16) 
reduces to 


2n+2 


ys coth(zm) — 24nt2 7 4n+3 2 pt! Box Ban+4—2k (17) 
mant3 (2k)! (4n + 4 — 2k)! 


m=1 


This variation of (15) was also first established by Lerch [44]. Later proofs were 
given by Watson [72], Sandham [64], Smart [70], Sayer [65], Sitaramachandrarao 
[69], and the first author [7, 8]. The special cases n = 0 and n = | are Entries 
25(i), (ii) in Chap. 14 of Ramanujan’s second notebook [58, p. 176], [10, p. 293]. 
He communicated this second special case in his first letter to Hardy [57, p. xxvi]. 

Significantly generalizing an idea of Siegel in deriving the transformation 
formula for the Dedekind eta function [68], Kongsiriwong [41] not only estab- 
lished (16), but he derived several beautiful generalizations and analogues of (16). 

Deriving a very general transformation formula for Barnes’ multiple zeta 
function, Komori, Matsumoto, and Tsumura [40] established not only (16), but also 
a plethora of further identities and summations in closed form for series involving 
hyperbolic trigonometric functions. 


20 B.C. Berndt and A. Straub 
4 Eisenstein Series 


A more modern interpretation of Ramanujan’s remarkable formula, which is 
discussed, for instance, in [30], is that (2) encodes the fundamental transformation 
properties of Eisenstein series of level 1 and their Eichler integrals. The goal of this 
section and the next is to explain this connection and to introduce these modular 
objects starting with Ramanujan’s formula. As in [26] and [30], set 


F,(2) = Ye (ner, a(n) = dt. (18) 


Observe that, with g = e’”, we can express F,,(z) as a Lambert series 


CO —a 


oO apn faa 
A =>-(ae|¢ =e =e 


n 
n=1 \ dln d=1m=1 n=1 7 n=1 


in the form appearing in Ramanujan’s formula (2). Indeed, if we let z = ai/z, then 
Ramanujan’s formula (2) translates into 


SEO + Fant} = zn | SO Fant (-;)} 
Zz 


\Im+1 m+i 
(277i) o Boy, Bom—2n+2 2n 


2g & (2n)! (2m—2n+ ar (19) 


This generalization to values z in the upper half-plane # = {z € C : Im(z) > 0} 
was derived by Grosswald in [25]. 

Ramanujan’s formula becomes particularly simple if the integer m satisfies m < 
—1. In that case, with k = —2m, Eq. (19) can be written as 


E,(z) = 7 *E,(-1/2), (20) 


with 


BiG) = + ial) = 1+ a5 nq’. QI) 


n=1 


The series E;, for even k > 2, are known as the normalized Eisenstein series 
of weight k. They are fundamental instances of modular forms. A modular form 
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of weight k and level | is a function G on the upper half-plane .#, which is 
holomorphic on # (and as z — ico), and transforms according to 


(cz + d) *G(Vz) = G(2) (22) 


for each matrix 


v= k 4 € SL>(Z). 


Here, as usual, SL2(Z) is the modular group of integer matrices V with determi- 
nant 1, and these act on # by fractional linear transformations as 


az+b 


23 
eg (23) 


a—— 


Modular forms of higher level are similarly defined and transform only with respect 
to certain subgroups of SL2(Z). Equation (20), together with the trivial identity 
Ex(z + 1) = E;(z), establishes that E;,(z) satisfies the modular transformation (22) 
for the matrices 


It is well known that S and T generate the modular group SL2(Z), from which it 
follows that £;,(z) is invariant under the action of any element in SL2(Z). In other 
words, E;(z) satisfies (22) for all matrices in SL2(Z). 

To summarize our discussion so far, the cases m < —1 of Ramanujan’s formula (2) 
express the fact that, for even k > 2, the q-series E,(z), defined in (21) as the 
generating function of sums of powers of divisors, transforms like a modular form 
of weight k. Similarly, the case m = —1 encodes the fact that 


[o,@) 
Ex(z) = 1-24) ° oy (n)q", 
n=1 


as defined in (21), transforms according to 
=) 6 
E,(2) = z “Ey(—1/z) — —. 
Wiz 


This is an immediate consequence of Ramanujan’s formula in the form (19). The 
weight 2 Eisenstein series E2(z) is the fundamental instance of a quasimodular form. 

In the next section, we discuss the case m > 0 of Ramanujan’s formula (2) and 
describe its connection with Eichler integrals. 
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5 Eichler Integrals and Period Polynomials 


If f is a modular form of weight k and level 1, then 


zf (-;) —f(z) = 90. 
z 


The derivatives of f, however, do not in general transform in a modular fashion 
by themselves. For instance, taking the derivative of this modular relation and 
rearranging, we find that 


ie (-;) -f'@= Kr, 
Zz x 


This is a modular transformation law only if k = 0, in which case the derivative 
f' (or, the —Ist integral of f) transforms like a modular form of weight 2. 
Remarkably, it turns out that any (k— 1)st integral of f does satisfy a nearly modular 
transformation rule of weight k—2(k—1) = 2—k. Indeed, if F is a (k—1)st primitive 
of f, then 


ZF (-;) — F(z) (24) 


is a polynomial of degree at most k — 2. The function F is called an Eichler integral 
of f and the polynomials are referred to as period polynomials. 
Let, as usual, 


1d d 


~ Onidz dq’ 


Remark 5.1 That (24) is a polynomial can be seen as a consequence of Bol’s identity 
[16], which states that, for all sufficiently differentiable F and V = [2 2] € SL2(R), 


(D‘"'F)(Vz) 


= Dt" [(cz + d)* °F(V2)]. 25 
=e. [(cz + FV) 25) 
Here, Vz is as in (23). If F is an Eichler integral of a modular form f = D‘~'F of 
weight k, then 


: . (DIFVV) 
Dee + dF) — FON = a pe OO = 0, 


for all V € SL2(Z). This shows that (cz + d)'?F(Vz) — F(z) is a polynomial of 
degree at most k — 2. A delightful explanation of Bol’s identity (25) in terms of 
Maass raising operators is given in [45, IV. 2]. 
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Consider, for m > 0, the series 
[oe 
Fom+i (z) = > O-2m-1 (n)q", 
n=1 


which was introduced in (18) and which is featured in Ramanujan’s formula (19). 
Observe that 


s fore) 
3 uaa ere (z) _ > » qo} nent gn = > Orm4i(n)q", 


n=1 d\n n=1 


and recall from (21) that this is, up to the missing constant term, an Eisenstein 
series of weight 2m + 2. It thus becomes clear that Ramanujan’s formula (19), in 
the case m > 0, encodes the fundamental transformation law of the Eichler integral 
corresponding to the weight 2m + 2 Eisenstein series. 


Remark 5.2 That the right-hand side of (19) is only almost a polynomial is due 
to the fact that its left-hand side needs to be adjusted for the constant term of the 
underlying Eisenstein series. To be precise, integrating the weight k = 2m + 2 
Eisenstein series (21), slightly scaled, k — 1 times, we find that 


€(—1 — 2m) (Qmiz)?"*! 


Gom41(Z) 2= 5) Qm +1)! 


+ Fom41 (2) (26) 


is an associated Eichler integral of weight —2m. Keeping in mind the evaluation 
€(—1 — 2m) = —Bon+2/(2m + 2), Ramanujan’s formula in the form (19) therefore 
can be restated as 


€(2m + 1) 


2m 
5 d=z7") 


1 
CHAS (-:) — Gom4+1(@) = 
Cri) ” Bon Bom—2n+2 


2n—1 | 27 
2 &+ (2n)! Qm—2n +2)!" 27) 


where the right-hand side is a polynomial of degree k — 2 = 2m. Compare, for 
instance, [74, Eq. (11)]. 

An interesting and crucial property of period polynomials is that their coefficients 
encode the critical L-values of the corresponding modular form. Indeed, consider a 
modular form 


CO 


f@) = Yoana" 


n=0 


of weight k. For simplicity, we assume that f is modular on the full modular group 
SL,(Z) (for the case of higher level, we refer to [54]). Let 
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a: Or . 1 a(n) ,, 
R= (k— + Grae 3 1f 


be an Eichler integral of f. Then, a special case of a result by Razar [62] and Weil 
[73] (see also [30, (8)]) shows the following. 


Theorem 5.3 [ff and f* are as described above, then 


int A 3a se Fee et ee 
9 (2) Lf 0-year * _" 
= 


where, as usual, L(s,f) = Bee 1 ayn * 


Example 5.4 For instance, if 


f@ = smi 'C( — KEx(@) = (277i) jeu —k)+ Yasin ; 


n=1 
where E; is the Eisenstein series defined in (21), then 
zl 
f@= 2m Sy aay DI + Fail) 


equals the function G2,41(z), where k = 2m + 2, used earlier in (26). Observe that 
the L-series of the Eisenstein series f(z) is 


L(s,f) = Qi") mn) = (2ni)"g(s)E(1—k +5). 


n=1 


The result (28) of Razar and Weil therefore implies that 


k-2 aa ? ; 
gg (-:) -~f*@ oe (272i o(k = 1 DEED (29) 
Zz 


! 
j=0 J 


and the right-hand side of (29) indeed equals the right-hand side of (27), which we 
obtained as a reformulation of Ramanujan’s identity. In particular, we see that (28) 
is a vast generalization of Ramanujan’s identity from Eisenstein series to other 
modular forms. 
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6 Ramanujan’s Formula for ¢(27 + 1) and Euler’s Formula 
for ¢(2n) are Consequences of the Same General Theorem 


Ramanujan’s formula (2) for €(2n + 1) is, in fact, a special instance of a general 
transformation formula for generalized analytic Eisenstein series or, in another 
formulation, for a vast generalization of the logarithm of the Dedekind eta function. 
We relate one such generalization due to the first author [5] and developed in [8], 
where a multitude of special cases are derived. Throughout, let z be in the upper 
half-plane #, and let V = [¢ | be an element of SL2(Z). That is, a,b,c,d € Z 
and ad — bc = 1. Furthermore, let Vz = (az+ b)/(cz +d), as in (23). In the sequel, 
we assume that c > 0. Let 7; and r2 be real numbers, and let m be an integer. Define 


lo.) 
A(z, —m, ry, rp) = S ~. kom o2aikaztriztr2) (30) 
n>—r| k=1 
and 
A(z, —m, 1,2) += A(z, —m, r1, 72) + (—1)"A(z, —m, —r1, —r2). (31) 


We define the Hurwitz zeta function ¢(s, a), for any real number a, by 


C(s,a) := Yi@ta™, Res > 1. 


n>—a 


For real numbers a, the characteristic function is denoted by A(a). Define R} = 
ar, + cry and Ry = br; + dro, where a,b, c,d are as above. Define 


e(.—mri.m) = tim 2 — Mri (ez + dy (e™E(5, 72) + (8.79) 
s—m (277i) 
4 MRI)(E(s, Ro) + e*E(s, -R)h | (32) 


As customary, B, (x), n => 0, denotes the n-th Bernoulli polynomial, and {x} denotes 
the fractional part of x. Let p := {Ro}c — {Ri }d. Lastly, define 


c m+2 


_ (-))! lez + ay! Ja {Ri jd+p 
h(z, —m, 11,12) = 2, 2 im +2—b)! By ( - ) Boa (“*) : 


where it is understood that if m+ 2 < 0, then h(z, —m, r1,r2) = 0. 
We are now ready to state our general transformation formula [5, p. 498], 
[8, p. 150]. 
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Theorem 6.1 [fz € # and m is any integer, then 


(cz + d)"H(Vz, —m, nr, 12) = H(z,—m, Ri, R) 


+ 29(z,—m, r1, 72) + (27 i)” * hz, —m,1r|,12). 
(33) 


We now specialize Theorem 6.1 by supposing that 7} = rz = O and that Vz = 
—1/z, so that c = 1 and d = 0. Note that, from (30) and (31), 


[o,2) 
kom! 
H(z, —m,0,0) = (1+ (-1)”) >> rh (34) 
k=1 
Theorem 6.2 [fz € # and m is any integer, then 


—m—1 —m—| 


> Kk = ik 
C0 PCW) ) gap Pe) Se ee 
k=1 k=1 


m+2 
; Be) Bn+2-x _ 
2 m+1 _.\k 7 
ae 2  @Hl—00 
(35) 
where 
i — log z, ifm = 0, 
e(z,—m) = ie ; (36) 
{1—(-z)"}S(m+ 1), ifm #0. 
Corollary 6.3 (Euler’s Formula for ¢(2n)) For each positive integer n, 
(—1)""!Bon 2 
2n) = ————_ (22). 37 
((2n) = San) G7) 


Proof Put m = 2n — 1 in (35). Trivially, by (34), H(z,-—2m + 1,0,0) = 0. 
Using (36), we see that (35) reduces to 


(2n)?"(-1)""! 


On)! {Bi (1) Bon — BonBiz"™"}, (38) 


(1+ 2"")£(2n) = 


where we have used the values B,(1) = By, k > 2, and By,4+,; = 0,k => 1. Since 
Bi) = 5 and B; = -5, Euler’s formula (37) follows immediately from (38). O 


Corollary 6.4 (Ramanujan’s Formula for ¢(2n+ 1)) Leta and B denote positive 
numbers such that aB = 1. Then, for each positive integer n, 
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oo p21 kon 1 
r++) 3 sak — 7 = ( ays cnt D4 ie] (39) 


n+1 
an 5° 1k Box = Bon+2—2k git ik gk 
(2k)! (Qn + 2 — 2k)! 


k=0 


Proof Set m = 2n in (35), and let z = im/a. Recall that z7/a = B. If we ag 
both sides by 3 (—B)~", we obtain (39). 


We see from Corollaries 6.3 and 6.4 that Euler’s formula for €(2n) and 
Ramanujan’s formula for ¢(2n+ 1) are natural companions, because both are special 
instances of the same transformation formula. 

We also emphasize that in the foregoing proof, we assumed that n was a positive 
integer. Suppose that we had taken m = —2n, where n is a positive integer. 
Furthermore, ifn > 1, then the sum of terms involving Bernoulli numbers is empty. 
Recall that 


2n 


e(1—2n) = —2 , n>. (40) 
2n 


We then obtain the following corollary. 


Corollary 6.5 Let a and B be positive numbers such that aB = 1m. Then, for any 
integern > 1, 


Corollary 6.5 is identical to Entry 13 in Chap. 14 of Ramanujan’s second 
notebook [58], [10, p. 261]. It can also be found in his paper [56, p. 269], [57, p. 190] 
stated without proof. Corollary 6.5 is also formula (25) in Ramanujan’s unpublished 
manuscript [59, p. 319], [1, p. 276]. 

Of course, by (40), we could regard (41) as a formula for ¢(1 — 2), and so we 
would have a third companion to the formulas of Euler and Ramanujan. The first 
proof of (41) known to the authors is by Rao and Ayyar [60] in 1923, with Malurkar 
[49] giving another proof shortly thereafter in 1925. If we replace n by 2n+ 1 in (41), 
where v1 is a positive integer, and set a = 6B = 7, we obtain the special case 


f2r- 1 


[o.) por! 
ak Gs YD aR sa He py (41) 


oo 
kant 1 Bana2 


erk— 1 4(2n+ 1)’ 


(42) 


k= 


which is due much earlier to Glaisher [24] in 1889. The formula (42) can also be 
found in Sect. 13 of Chap. 14 in Ramanujan’s second notebook [58], [10, p. 262]. 
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There is yet a fourth companion. If we set m = —2 in (35), proceed as in the 
proof of Corollary 6.4, and use (40), we deduce the following corollary, which we 
have previously recorded as Entry 2.3, and which can be thought of as “a formula 
for €(—1).” 


Corollary 6.6 Let a and f be positive numbers such that a8 = 1*. Then 
CO 


k = a+p 1 
«Daa +P ae = oa oe (43) 


Ifa = B = 2, (43) reduces to 


loo) 


3 e i 4 na 
Lo ek] 4 8" 


Both the special case (44) and the more general identity (43) can be found in 
Ramanujan’s notebooks [58, vol. 1, p. 257, no. 9; vol. 2, p. 170, Corollary 1], [10, 
pp. 255-256]. However, in 1877, Schlémilch [66], [67, p. 157] apparently gave the 
first proof of both (44) and (43). 

In conclusion of Sect. 6, we point out that several authors have proved general 
transformation formulas from which Ramanujan’s formula (39) for ¢(2n + 1) can 
be deduced as a special case. However, in most cases, Ramanujan’s formula was 
not explicitly recorded by the authors. General transformation formulas have been 
proved by Guinand [28, 29], Chandrasekharan and Narasimhan [18], Apostol [3, 4], 
Mikolas [50], Iseki [31], Bodendiek [14], Bodendiek and Halbritter [15], Glaeske 
[22, 23], Bradley [17], Smart [70], and Panzone, Piovan, and Ferrari [53]. Guinand 
[28, 29] did state (39). Due to a miscalculation, ¢(2n+ 1) did not appear in Apostol’s 
formula [3], but he later [4] realized his mistake and so discovered (39). Also, recall 
that in Sect. 3, we mentioned the very general transformation formula for multiple 
Barnes zeta functions by Komori, Matsumoto, and Tsumura [40] that contains 
Ramanujan’s formula (39) for ¢(2n + 1) as a special case. Lastly, in the beautiful 
work of Katsurada [38, 39] on asymptotic expansions of q-series, Ramanujan’s 
formula (2) arises as a special case. 

We also have not considered further formulas for €(2n + 1) that would arise from 
the differentiation of, e.g., (33) and (35) with respect to z; see [8]. 


7 The Associated Polynomials and Their Roots 


In this section we discuss the polynomials that are featured in Ramanujan’s 
formula (2) and discuss several natural generalizations. These polynomials have 
received considerable attention in the recent literature. In particular, several papers 
focus on the location of zeros of these polynomials. We discuss some of the 
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developments starting with Ramanujan’s original formula and indicate avenues for 
future work. 
Following [30] and [51], define the Ramanujan polynomials 


m+1 
Boy Bom— 2n+2 2n 


Rom+1 (z) _ > (2n)! (2m —2n+ yi 


n=0 


(45) 


to be the polynomials appearing on the right-hand side of Ramanujan’s formula (2) 
and (19). The discussion in Sect. 5 demonstrates that the Ramanujan polynomials 
are, essentially, the odd parts of the period polynomials attached to Eisenstein series 
of level 1. In [51], Murty, Smyth, and Wang proved the following result on the zeros 
of the Ramanujan polynomials. 


Theorem 7.1 Form > 0, all nonreal zeros of the Ramanujan polynomials Ryn-+\ (Z) 
lie on the unit circle. 

Moreover, it is shown in [51] that, for m > 1, the polynomial R»,,4+)(z) has 
exactly four real roots and that these approach +2*!, as m — oo. 


Example 7.2 As described in [51], one interesting consequence of this result is 
that there exists an algebraic number a € .# with |a| = 1, a” 1, such that 
Rom+ (a) = 0 and, consequently, 


C(2m +1) Fom+i (ot) — 07" Fom+1 (—3) 
2 — qn — | ? 


(46) 


where F is as in (18). In other words, the odd zeta values can be expressed as 
the difference of two Eichler integrals evaluated at special algebraic values. An 
extension of this observation to Dirichlet L-series is discussed in [13]. Equation (46) 
follows directly from Ramanujan’s identity, with z = a, in the form (19). 
Remarkably, Eq. (46) gets close to making a statement on the transcendental nature 
of odd zeta values: Gun, Murty, and Rath prove in [30] that, as w ranges over all 
algebraic values in .# with a?” 4 1, the set of values obtained from the right-hand 
side of (46) contains at most one algebraic number. 

As indicated in Sect. 5, the Ramanujan polynomials are the odd parts of the 
period polynomials attached to Eisenstein series (or, rather, period functions; see 
Remark 5.2 and [74]). On the other hand, it was conjectured in [42], and proved in 
[43], that the full period polynomial is in fact unimodular, that is, all of its zeros lie 
on the unit circle. 


Theorem 7.3 For m > 0, all zeros of the polynomials 


€(2m + 1) 


2m+1 
(Q7i)2m41 (z a) 


Rom4i(Z) + 


lie on the unit circle. 
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An analogue of Theorem 7.1 for cusp forms has been proved by Conrey, Farmer, 
and Imamoglu [19], who show that, for any Hecke cusp form of level 1, the odd 
part of its period polynomial has trivial zeros at 0, +2*! and all remaining zeros lie 
on the unit circle. Similarly, El-Guindy and Raji [20] extended Theorem 7.3 to cusp 
forms by showing that the full period function of any Hecke eigenform of level 1 
has all its zeros on the unit circle. 


Remark 7.4 In light of (27) it is also natural to ask whether the polynomials 


P (2) := = f(2m + Dia a2") _ Oh) aude . Boy, Bom—2n+2 pri 
. 2 2 & (2n)! (2m — 2n + 2)! 

are unimodular. Numerical evidence suggests that these period polynomials p,,(z) 

are indeed unimodular. Moreover, let p;,(z) be the odd part of p,,(z). Then, the 

polynomials p;, (z)/z also appear to be unimodular. It seems reasonable to expect 

that these claims can be proved using the techniques used for the corresponding 

results in [51] and [43]. We leave this question for the interested reader to pursue. 
Very recently, Jin, Ma, Ono, and Soundararajan [32] established the following 

extension of the result of El-Guindy and Raji to cusp forms of any level. 


Theorem 7.5 For any newform f € S,(Io(N)) of even weight k and level N, all 
zeros of the period polynomial, given by the right-hand side of (28), lie on the circle 
Iz] = 1/VN. 

Extensions of Ramanujan’s identity, and some of its ramifications, to higher level 
are considered in [13]. Numerical evidence suggests that certain polynomials arising 
as period polynomials of Eisenstein series again have all their roots on the unit 
circle. Especially in light of the recent advance made in [32], it would be interesting 
to study period polynomials of Eisenstein series of any level more systematically. 
Here, we only cite one conjecture from [13], which concerns certain special period 
polynomials, conveniently rescaled, and suggests that these are all unimodular. 


Conjecture 7.6 For nonprincipal real Dirichlet characters y and w, the general- 
ized Ramanujan polynomial 


k B, B - 1 k—s—1 
RG iw = oa (: 7 ) a—2}) (47) 
s=0 


is unimodular, that is, all its roots lie on the unit circle. 
Here, B,,, are the generalized Bernoulli numbers, which are defined by 


co 


Bn = (48) 


n=0 


if x is a Dirichlet character modulo L. If y and yw are both nonprincipal, then 
Ri.(z; x, W) is indeed a polynomial. On the other hand, as shown in [13], if 
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k > 1, then Ro;(z; 1,1) = Rox—1(z)/z, that is, despite their different appearance, 
the generalized Ramanujan polynomials reduce to the Ramanujan polynomials, 
introduced in (45), when y = 1 and y = 1. For more details about this conjecture, 
we refer to [13]. 


8 Further History of Proofs of (2) 


In this concluding section, we shall offer further sources where additional proofs of 
Ramanujan’s formula for (27 + 1) can be found. We emphasize that two papers 
of Grosswald [25, 27], each proving formulas from Ramanujan’s second notebook, 
with the first giving a proof of (2), stimulated the first author and many others to 
seriously examine the content of Ramanujan’s notebooks [58]. Katayama [36, 37], 
Riesel [63], Rao [61], VepStas [71], Zhang [75], and Zhang and Zhang [76] have also 
developed proofs of (2). Ghusayni [21] has used Ramanujan’s formula for ¢(2n + 1) 
for numerical calculations. 

For further lengthy discussions, see the first author’s book [10, p. 276], his survey 
paper [6], his account of Ramanujan’s unpublished manuscript [12], and his fourth 
book with Andrews [1, Chap. 12] on Ramanujan’s lost notebook. These sources 
also contain a plethora of references to proofs of the third and fourth companions to 
Ramanujan’s formula for €(2n + 1). Another survey has been given by Kanemitsu 
and Kuzumaki [33]. 

Also, there exist a huge number of generalizations and analogues of Ramanujan’s 
formula for ¢(2n + 1). Some of these are discussed in Berndt’s book [10, p. 276] 
and his paper [9]. Lim [46-48] has established an enormous number of beautiful 
identities in the spirit of the identities discussed in the present survey. Kanemitsu, 
Tanigawa, and Yoshimoto [34, 35], interpreting Ramanujan’s formula (2) as a 
modular transformation, derived further formulas for €(2n + 1), which they have 
shown lead to a rapid calculation of ¢(3) and ¢(5), for example. 

In Sects. 4 and 5 we have seen that Ramanujan’s formula can be viewed as 
describing the transformation laws of the modular Eisenstein series E>,, where 
k > 1, of level 1 (that is, with respect to the full modular group), the quasimodular 
Eisenstein series E> as well as their Eichler integrals. We refer to [13] for extensions 
of these results to higher level. 
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Towards a Fractal Cohomology: Spectra of 
Polya—Hilbert Operators, Regularized 
Determinants and Riemann Zeros 


Tim Cobler and Michel L. Lapidus 


Abstract Emil Artin defined a zeta function for algebraic curves over finite fields 
and made a conjecture about them analogous to the famous Riemann hypothesis. 
This and other conjectures about these zeta functions would come to be called the 
Weil conjectures, which were proved by Weil in the case of curves and eventually, 
by Deligne in the case of varieties over finite fields. Much work was done in the 
search for a proof of these conjectures, including the development in algebraic 
geometry of a Weil cohomology theory for these varieties, which uses the Frobenius 
operator on a finite field. The zeta function is then expressed as a determinant, 
allowing the properties of the function to relate to the properties of the operator. 
The search for a suitable cohomology theory and associated operator to prove the 
Riemann hypothesis has continued to this day. In this paper we study the properties 
of the derivative operator D = 4 on a particular family of weighted Bergman 
spaces of entire functions on C. The operator D can be naturally viewed as the 
“infinitesimal shift of the complex plane” since it generates the group of translations 
of C. Furthermore, this operator is meant to be the replacement for the Frobenius 
operator in the general case and is used to construct an operator associated with any 
given meromorphic function. With this construction, we show that for a wide class 
of meromorphic functions, the function can be recovered by using a regularized 
determinant involving the operator constructed from the meromorphic function. 
This is illustrated in some important special cases: rational functions, zeta functions 
of algebraic curves (or, more generally, varieties) over finite fields, the Riemann 
zeta function, and culminating in a quantized version of the Hadamard factorization 
theorem that applies to any entire function of finite order. This shows that all of 
the information about the given meromorphic function is encoded into the special 
operator we constructed. Our construction is motivated in part by work of Herichi 
and the second author on the infinitesimal shift of the real line (instead of the 
complex plane) and the associated spectral operator, as well as by earlier work and 
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conjectures of Deninger on the role of cohomology in analytic number theory, and 
a conjectural “fractal cohomology theory” envisioned in work of the second author 
and of Lapidus and van Frankenhuijsen on complex fractal dimensions. 


1 Introduction 


Riemann’s famous paper [31] opened up the use of complex analysis to study the 
prime numbers. This approach has yielded many great results in number theory, 
including, but certainly not limited to, the Prime Number Theorem. Riemann also 
made his well-known conjecture that stands to this very day. We will refer to this as 
(RH) in this paper. 


Conjecture (RH) The only nontrivial zeros of €(s) occur when s satisfies R(s) = 5. 

For further reading about €(s), see [30, 37], and [13]. However, despite (RH) 
remaining unsolved over 150 years after it was made, there is an analogue for 
zeta functions of algebraic varieties over finite fields that has been proven. The 
development of this theory also introduced new techniques to number theory. We 
will begin with a short history of this result. 


1.1 The Weil Conjectures 


Using the Euler product representation of the Riemann zeta function in terms of the 
rational primes, 


és) =[[a-p°y, (1) 
Pp 


as a template, it is possible to define the zeta function of an algebraic curve over a 
finite field as follows. 


Definition 1 Let Y be a smooth, geometrically connected curve over F,,, the finite 
field with g elements. Then the zeta function of Y is given by 


ts) = [J a-lKly (2) 


yelY| 


where |Y| is the set of closed points of Y and |k,| is the size of the residue field of y. 

This formulation of the zeta function of an algebraic curve over a finite field 
shows the analogy with Riemann’s zeta function, but we will prefer the following 
equivalent expression Cy(s) = exp(S~-2, q~™), where Y, is the number of 
points of Y defined over Fx, the degree n extension of F,. The study of these zeta 


functions began in 1924 in Emil Artin’s PhD thesis [1]. These were further studied 
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by Schmidt, who proved, in 1931, that ¢y(s) was a rational function of g~ in [34], 
and Hasse, who showed, in 1934, in [18], that if Y is an elliptic curve, then the zeros 
of Cy(s) satisfy R(s) = 5. Thus, the corresponding version of (RH) holds for these 
zeta functions of elliptic curves over finite fields. Furthering this idea, Weil then 
proved, in 1946-1948, that this same version of (RH) holds for algebraic curves of 
arbitrary genus and for abelian varieties in [40]. (See also [38, 39] and [41].) Below 
we present a sketch of some of the ideas contained in a modern proof of these results, 
which are based on Weil’s ideas, and will motivate the work contained in this paper. 
First, a sequence of the so-called Weil cohomology groups for the curve Y are 
formed, in particular H°, H', H? are the only nontrivial groups, with dimH°® = 
dim H? = 1 and dimH! = 2g where g denotes the genus of Y. Then the Frobenius 
map F which sends y — y% acts on the space F, for any n and therefore induces 
a morphism of the curve Y over F, (the algebraic closure of F,) as well as, in fact, 
also induces a linear map on the cohomology groups F* : H/ — H/, forj € {0, 1, 2}. 
Next, consider the Lefschetz fixed point formula from topology. 


Theorem 1 (Lefschetz Fixed Point Formula) Let Y be a closed smooth manifold 
and let f : Y — Y be a smooth map with all fixed points nondegenerate. Then 
yjo(— 1 Tr(f* |H’) is equal to the number of fixed points of f. 

Note that in Theorem |, since Y is finite-dimensional, only finitely many of the 
cohomology spaces H/ are nontrivial. 

We apply the topological version of this result to the nth power of the Frobenius 
map, F”, whose fixed points are exactly the points on the curve Y defined over Fn. 
That is, all those points with every coordinate in Fj». This gives 


2 
ICD THF" |) = Yn, (3) 


J=0 


where F*|H/ (for j = 0, 1, 2) denotes the linear operator induced on the cohomology 
space H! by the Frobenius morphism F. 
To proceed further, we need the next result from linear algebra. 


Theorem 2 /f f is an endomorphism of a finite-dimensional vector space V, then 
for |t| sufficiently small, exp (SV, 40°Tr(f"|V)) = det —f + t|V)71. 

Applying this result to the Frobenius operator F, we can proceed with the 
following calculation: 


fy(s) = exp (>: a) 


n=1 


[eve) 2 
1 j a7 j\ ,—ns 
= exp y 7 y (—1)Tr(F™ |A’)q 


n=l j=0 
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2 oO 4 . (-1)/ 
= I] (os (>: ae |H’)q “)) 


j=0 
- i al 
= I] (det — F*q~*|H’)) 


_ det — F*q~|H') 
~ det(I — F*g~|H®) det(I — F*q-|H2) 


(4) 


This enables us to express the zeta function of a curve Y as an alternating 
product of characteristic polynomials of the Frobenius operators, or more precisely, 
of determinants of J — g-*F* over the cohomology spaces. Since these spaces are 
finite-dimensional, this equation further shows that ¢(Y, s) is a rational function of 
q *, which yields Schmidt’s result. We also see that the zeros of Cy(s) are given 
from the eigenvalues of the operator F* on H!', while the poles are given from the 
eigenvalues on H® and H?. To complete the proof, it was shown by Weil using the 
intersection theory of divisors to show that the intersection is positive definite that 
the eigenvalues of F* on H! have absolute value q? and thus the zeros of ¢(Y,s) 
satisfy H(s) = ‘ 

Weil then conjectured that all of the above and more could be generalized 
to any non-singular, projective variety of dimension d, defined over F,. About a 
decade later, Alexander Grothendieck announced he would be revamping algebraic 
geometry with the goal of proving these Weil conjectures. Several attempts to 
construct a proper “Weil cohomology” were incomplete, but eventually these 
provided the key idea to the proof of the Weil conjectures. Grothendieck even came 
up with more general conjectures based on this study of what properties a “Weil 
cohomology” must possess. The version of (RH) sought after would then follow 
from these. Some of his work outlining these ideas are [14, 15], and [16]. However, 
Pierre Deligne, a student of Grothendieck, would go on to prove, in 1973, this 
version of (RH) without proving Grothendieck’s “standard conjectures,” which are 
still unproven today. See [8] and [9] for Deligne’s work. Thus, Weil’s conjectures 
were completed as a result of the introduction of, or at the very least, expansion of, 
the use of topology and cohomology in number theory. For a more complete history 
of the Weil conjectures, see [12, 24] and [29]. 


1.2 Polya—Hilbert Operators and a Cohomology Theory in 
Characteristic Zero 


As seen in the previous section, the Frobenius operator became fundamental to the 
resolution of the version of (RH) dealing with algebraic varieties (or even with 
curves) over finite fields. The eigenvalues of this operator on different cohomology 
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groups gave us the zeros and poles of the zeta function of the variety. If such an 
operator could be found for the Riemann zeta function, then perhaps this work 
would extend and help one to prove (RH). However, if you instead consider the 
function ¢ G + if) as a function of t, then (RH) is equivalent to all the nontrivial (or 
critical) zeros of this function being real. This then leads into what is known as the 
Polya—Hilbert conjecture. 


Conjecture (Polya—Hilbert Conjecture) The critical zeros of ¢ (5 + it) correspond 
to the eigenvalues of an unbounded self-adjoint operator on a suitable Hilbert space. 

Since then, motivated in part by the above reformulation, many physicists, 
mathematicians, and mathematical physicists have been looking for a convincing 
physical reason why (RH) should be true. In particular, it has been conjectured 
by Michael Berry in [3] (and several other papers) that a trace formula for a 
suitable (classically chaotic) quantum-mechanical Hamiltonian could formalize this 
connection between the spectrum of an operator and the Riemann zeros. See also 
[4] for a discussion of these ideas. In fact, Alain Connes, in [6], conjectured the 
existence of a suitable noncommutative version of such a trace formula. However, 
as in every other approach to proving (RH), the search for the correct way to make 
this potential approach work continues to this day. 

Building on Alexander Grothendieck’s ideas, Christopher Deninger has postu- 
lated in [10, 11] (and other papers) that the cohomology theory used to prove the 
Weil conjectures could be extended to eventually prove the Riemann hypothesis. In 
particular, he envisions a cohomology theory of algebraic schemes over Spec(Z) that 
would conjecturally help prove the Riemann hypothesis and solve other important 
problems in analytic number theory. In his papers, he lays out some of the difficulties 
in doing so as well as some of the properties that such a theory would need to satisfy. 

We also mention that Shai Haran [17] has obtained interesting trace formulas 
yielding Weil’s explicit formula: that is, of Weil’s interpretation in [42] of Riemann’s 
explicit formula [13, 25, 28, 30, 31, 37]. 


1.3. Fractal Cohomology 


All of the previous ideas as well as separate connections between fractals and the 
Riemann zeta function ¢(s) discussed in [27] and [28] motivated the second author 
to pursue a fractal cohomology to try to tie together all of the ideas presented so far. 
The text [25] outlines his ideas for how the theory of fractals might give information 
about the Riemann zeta function. See also Sect. 12.4 of [28] for a discussion of the 
main properties that such a fractal cohomology theory should satisfy, by analogy 
with the case of varieties over finite fields and self-similar strings. 

In search of the elusive “Frobenius operator in characteristic 0,” the second author 
worked with H. Herichi to develop a “Quantized Number Theory” in [20—23, 26]. 
There, they used an operator they denoted 0, which was the derivative operator 
on a suitable family of Hilbert spaces. This operator had many nice properties, 
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including being a generator for the infinitesimal shift group on these spaces as well 
as having a spectrum consisting of a single vertical line in the complex plane. This 
allowed them to focus on the values of €(s) on R(s) = c for c € (0, 5) or for 
ce ( 5. 1) and obtain a reformulation of (RH) within this theory. This involved 
studying an operator-valued version of ¢(s), which they called a quantized zeta 
function. An overview of these ideas and results can be found in [26], while a 
detailed exposition of the theory is provided in [23]. 

This paper then continues this search of an appropriate substitute for Frobenius 
in characteristic zero. In an attempt to further localize the spectrum of the derivative 
operator, we turn to a family of weighted Bergman spaces, which provide the basis 
for our construction. We will begin by recalling some needed functional analysis 
building up to the regularized determinants that we will need. Then we discuss 
the family of Bergman spaces and the needed properties of the derivative operator 
on them, which allows our construction to work. At this point, we will detail our 
construction to create a Frobenius replacement. This provides a general framework 
to find a substitute for the Frobenius operator which will be shown to apply to 
any entire function of finite order as well as certain meromorphic functions of 
interest such as ¢(s). However, there is still much to be done. We do not have a 
true cohomology theory as we do not have a suitable notion for how to define the 
geometry in our context. We will finish with a discussion of what is lacking from 
this theory and where to go from here. 


2 Background 


This section loosely follows [36] in developing the necessary theory for trace ideals 
and regularized determinants to be used in this paper. See [35, 36] for detailed 
historical notes and references, along with a discussion of the many contributions to 
this subject. 

To describe what trace ideals are, we recall some standard facts about compact 
operators on a separable Hilbert space H. 


Theorem 3 Let A be a compact operator on H. Then there are orthonormal sets 
{Wn} and {,} and positive real numbers [1,(A), with (A) > [2(A) = ---, 
such that A = D>, [n(A)(Wn,-)bn. Moreover, the numbers [1,(A) are uniquely 
determined. 

The positive real numbers jz,(A) from the previous theorem are called the 
singular values of A. We can actually describe {j,(A)} in another way. Given an 
operator A, the operator A*A is a nonnegative operator, so that |A| := /A*A makes 
sense. The j1,(A)’s are exactly the (nonzero) eigenvalues of |A|. We can now turn 
to Calkin’s theory of operator ideals. We begin by setting up a relationship between 
ideals in B(H) and certain sequence spaces. 
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Definition 2 Fix an orthonormal set {¢,} in H. Given an ideal J 4 B(H); we define 
the sequence space associated with J by 


SU) = 44 = (4.42...) Yann. bn Se - (5) 


On the other hand, given a sequence space s, let /(s) be the family of compact 
operators A with ({1;(A), [42(A),...) ES. 

In order for this correspondence between sequence spaces and ideals to be one- 
to-one, we need to restrict our sequence spaces to Calkin spaces. We then need the 
following operator on sequences. 


Definition 3 Given an infinite sequence, (a,,), of numbers with a, > 0 as n — oo, 
a;, is the sequence defined by af = max; |aj|, af + a} = max;z;(|a;| + |aj|), etc. 
Thus aj > aj} > ---, and the sets of a* and |a;| are identical, counting multiplicities. 
This operator allows us to make the following definition. 


Definition 4 A Calkin space is a vector space, s, of sequences (a,) with lim a, = 
noo 


0, and the so-called Calkin property: a € s and b* < a* implies b € s. 
With these definitions in mind, we can use the following theorem to see a relation 
between two-sided ideals and Calkin spaces. 


Theorem 4 ([36]) /fs is a Calkin space, then I(s) is a two-sided ideal of operators 
and S(I(s)) = s. Furthermore, if J is a two-sided ideal, then S(J) is a Calkin space 
and I(S(J)) = J. 

We will now use this relation to define the ideals in the space of compact 
operators that we will be working with. 


Definition 5 A compact operator A is said to be in the trace ideal J,, for some 
p= lif », [n(A)? < oo. That is, J, is the ideal which is associated with the 
Calkin space /?. An element A of J; is called a trace class operator. For A € J, we 
define Tr(A) = )>,,(¢n,Ag¢,) for any choice of orthonormal basis {@,}. If A € Jo, 
then we say that A is Hilbert—Schmidt. 

Trace class operators, A, are precisely those operators for which the expression 
Tr(A) = ¥0,(bn.Agn) is absolutely convergent and independent of the choice 
of orthonormal basis. Similarly, Hilbert-Schmidt operators are those for which 
>, (Abn, Abn) = ||Adn||? is convergent and independent of the choice of orthonor- 
mal basis. If A is a trace class operator, then there is a method to define a so-called 
Fredholm determinant, det(I+zA), which defines an entire function on C. Operators 
of the form / + ZA for a trace class operator A are called Fredholm. This determinant 
can be defined in several equivalent ways. We list them here for trace class A and 
zeC: 


det(I + zA) := eos) (6) 
for small |z| and then analytically continued to the whole complex plane, 


det(I + cA) = D0 A Tr(A*(A)) (7) 


k=0 
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with A*(A) defined in terms of alternating algebras, and 


N(A) 


det(I + <A) = |] +A, (A), (8) 


k=1 


where the complex numbers A;(A) are the nonzero eigenvalues of A and N(A) 
is the number of such eigenvalues, which can be infinite. In the latter case, the 
corresponding infinite product is convergent. 

A discussion concerning which of the above equations should be taken as a 
definition and which are to be proven appears briefly in Chap.3 of [36] and in 
more detail in [35]. For the work here, (8) will be the most convenient choice. One 
thing to note at this time though is that det(J + zA) does define an entire function 
by any of the above definitions, when A is trace class. This then shows why one 
cannot hope to recover a meromorphic function by simply taking the determinant of 
a suitable operator without taking the quotient of such determinants as was seen in 
the discussion of the Weil conjectures. 

Although some of the operators we will consider will not be trace class, they will 
at least be in one of the other trace ideals J,, for some n € N. In this case, we can 
define a regularized determinant that will allow us to get a determinant formula for 
the operator. We start by considering an expression of the form det(J + zA)e~"™), 
For trace class operators A, both det(J + zA) and e~/") are convergent, but for 
Hilbert-Schmidt operators neither is necessarily well defined. And yet, when you 
consider the two factors together as a possibly infinite product over the eigenvalues 
of A, 


N(A) 


[ [ + cA) exp-AA)z) , 


k=1 


the combined term does converge for Hilbert-Schmidt operators. This idea can in 
fact be extended to get a convergent infinite product expression for operators in any 
J,, which will be called the regularized determinant of A. First we need a lemma. 


Lemma 1 ((36]) For A € B(H), let 


n—1 


Ry(A) = | E+ Adexp[ DIF 4’ | | -1. (9) 


j=l 


Then if A € Jn, we have R,(A) € Ji. 
This associates a trace class operator to any given A € J, and allows us to define 
the regularized determinant of A as follows: 
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Definition 6 ((36]) For A € J, define det,,(7 + A) = det(J + R,(A)). 

Note that this definition implies that det;(/ + A) = det(J + A), the usual 
Fredholm determinant. We will use these two notations interchangeably from here 
on. Also with this definition, we can now give a very similar product formula for the 
regularized determinant of a Hilbert-Schmidt operator, with each term having an 
exponential factor to help convergence along with some other interesting properties. 
This corresponds to the n = 2 case of the following result. 


Theorem 5 ([36]) For A € J,, we have 


N(A) n—1 
det, (7+ A) = [] | + pag(A)exp| SOI AAYW |} ]. (10) 
k=1 j=l 


These regularized determinants are related to the usual Fredholm determinant of 
1 +A for trace class operators A in the following fashion. 


Theorem 6 ([36]) For A € Ji, we have 


n—-1 
det, (I + A) = det(I + A) exp > (—1)j7'Tr(A’) ] . (11) 


j=l 


These “regularized determinants,” just as the Fredholm determinants, define an 
entire function and will be key to the precise formulation of our results. This will 
be because our construction will not always create a trace class operator for which 
the standard Fredholm determinant would apply. This will be the case, in particular, 
for the Riemann zeta function, for which the regularized determinant det, will be 
needed; see Theorems 27 and 28 in Sect.5.4 below. However, we will show that 
for any entire function of finite order and for many meromorphic functions, our 
construction will give an operator that is at least in some J, and thus the regularized 
determinant will apply to it. 


3 Derivative Operator on Weighted Bergman Spaces 


The search for an operator to possibly take the place of the Frobenius in the proof 
of the Weil conjectures led us to consider the derivative operator. A treatment 
examining the derivative operator on L?(R, e~*“dt) and its use to create a “quantized 
number theory” can be found in the research monograph [23], as well as in the 
accompanying articles [20-22] and [26]. 

This paper takes a different direction with the derivative operator. We begin by 
following the treatment in [2] in developing properties of the derivative operator on 
a certain family of weighted Bergman spaces. We will then continue beyond their 
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results and use all of this to create an operator that might properly take the place 
of the Frobenius. We begin by recalling the definitions of the spaces we will be 
working with. (See, e.g., [19], for a general reference about Bergman spaces.) 


Definition 7 We define a weight function to be a positive continuous function w 
on C. Then, for 1 < p < ov, we define the weighted L? spaces to be L? (C), the 
space of functions on C such that fw € L? Ce di), where A is the Lebesgue measure 
on R?, and equipped with the norm ||| we. = \lAwilipce2y. Next, denote by BY, the 
subspace of entire functions in L’; then, B? is called a weighted Bergman space of 
entire functions. 

Note that the convention above for functions f € L?, would be those for which 
Jc lfl?weda < oo instead of f,, |f|?wdA < 00. Then we have the following basic 
fact about these spaces. 


Theorem 7 For P. > 1, BP is aclosed subspace of L, and hence is a Banach space. 
Also, for p = 2, B2, is a Hilbert space. 

Now we consider the differential operator D = = on the space B?, and examine 
its properties; including for particular choices of w an p. 

Consider the following types of weight functions: w(z) = e?)) where @ is 
a nonnegative, concave, monotone (i.e., nondecreasing), subadditive function on 
R+ = [0, co) such that w(0) = 0 and 


t 
oO _ “ia 
00 logt — 
We next define 
t 
= ee (13) 
t>+oo f¢ 


We then have the following results in this situation (with No := {0,1,2,...}). 


Theorem 8 ((2]) Let 1 < p < co and w be a weight function with constant a, as 
in (13) above. Then, 


1) The differentiation operator D = < is a bounded linear operator on B?.; 
2) Forallr > 0, and forn € No, we hae the bound ||D"|| < n!r~"e?™, 


Proof We will prove 2) noting that 1) follows from it. Suppose that f € By and 
r > 0. Cauchy’s formula for the n” derivative of f reads as 


a feo+D 
DIG) =o ha. a (14) 
We now consider the case p = ov. Let z,z € C with |z| = r. Then since ¢ is 


subadditive and monotonic, we have: $(|zo + z|) < #(|zol + |zl) < @(zol) + o((zI). 
Also, |[flloow = SUpzec [f(z)|e~?"_ > [F (zo + zle~P*2) by the definition of 
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the norm. This leads to: |f(zo + 2)| < [lf lloo we?(@t) < [If loo we? eh), Then 
by (14), we have for any z € C that 


ID"F@| < nlr™ sup [f (zo + 2)| <a" flo we? Me™. 


|z|=r 


Thus ||D"f loo.» = Ap ID" (z)|e7F 2) < nr" e®™ [If |] 50. Therefore, we have that 


|D"|| < nlrreb(?), Neb we turn to the case 1 < p < oo. Let z € C. Applying 
Holder’s inequality in (14) yields 


|D"f (z)| < 


1 
20 ; Pp 
( If(e+ rena) 
0 


(27)? r” 


This leads to 


i |D"f (z) Pe Poe DdA(z) < < P =f (/. f(z + rel?) [Pe Pole Dan(e)) 8. 
27 r?’ 


By making a change of variable, we can rewrite this as 


1p 2a : 
[rerrerre rane) = 5 [ (f veareree"ance) ao 
Cc 2mrP" Jo c 


(15) 


Using the triangle inequality for ¢, 
JP (zl) — (lz re? /)| < Pre") = O0), 
in the inner integral on the right-hand side of (15), we obtain that 
/ If) Pe Polle—re" I) a) (z) = / F(z) [Pe PEMD er(Olel) (ler D) gq (z) 
Cc Cc 
2 ey f(z) [Pe PP) ad (z) < ePP FIP o 
. 


Applying this estimate to (15) then yields 


i D'folPe 


Thus ||D"fl|pw < n'r"e% |[fllpw, and it follows that ||D"|| < n!re?. This 
concludes the proof of the theorem. 0 

We also have the following result about the spectrum, o (D), of D. (See, e.g., [32] 
for a discussion of spectral theory and the functional calculus in this context.) 


ni? ePe) 


Pp 
—— lw 


pwd = 
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Theorem 9 ([2]) Under the conditions of Theorem 8, the spectrum o(D) is given 
for any a = 0 by 


o(D) = Ag:= {z € C: |z| < a}. (16) 


In particular, if a = 0, then A, = {0}. 


Proof Let e,(z) = e* for A € C. Clearly, we have De, = Ae, and so e, is an 
eigenvector of the operator D with eigenvalue A, as long as e, € B?,. However, if 
|A| < a and we write z = re’ and A = |A|e’®, we have the following: 


lex (z)e PU) | = jelAlePF—907)| = elAlrcos(B+4)-$(r) < o(lal-22)_ (17) 
But by (13), then this function is integrable, so e, € B?, for |A| < a. Thus we 
have A, C o(D). To complete the proof we will show that r(D), the spectral radius 
of D, satisfies the inequality r(D) < a. It suffices to show that r(D) < a+ e 
for any € > 0; so let us fix € > 0. Then again using (13), we see that there is 
to > 0 such that d(t) < (a+ €)t for tf > ft. Thus by part 2 of Theorem 8 we 
have that ||D"|| < Cn!r-"e“@+©" for any r > 0,n = 1,2,..., where C is a constant 
depending only on €. Minimizing this expression with respect to r yields the critical 
value r = 77>. Substituting this choice of r gives ||D"|| < C mete Applying 
Stirling’s formula gives that ||D”"|| < f(n), where f() is asymptotic to a constant 
times ./n(a + €)". Thus we have r(D) = tim. |D" ||" < a+e. We conclude that 
r(D) = a and o(D) = Ag, as desired. O 

To further study this operator, we restrict our attention to the special case when 
p = 2, where we actually have a Hilbert space with inner product given by (f, g) = 
Seofge ela. It is convenient to have a particular simple orthonormal basis to 
deal with, and, since we are dealing with entire functions that are guaranteed to 
have convergent power series, it makes sense to look at polynomials to try to find 
this orthonormal basis. It turns out that all we need are monomials. 


Theorem 10 ((2]) There exist constants Cy, such that {un(z)}, where un(z) = Cn", 
forn € No, forms an orthonormal basis for B?.. 


Proof First note that (z",z”") = 0 ifn 4 m. This follows from a simple calculation 
using the fact that the weight function is radial, so that using polar coordinates, we 
have 


20 lo.) 
(z", z”) = / y pew ind pein? e 29 rdrdé 
0 0 


2n oo 
= i eras | pitt] 6-26) dp 
0 0 


=0 
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ifn # m. Note that the integral on r converges for any n,m € No by the properties of 
our weight function. Thus the monomials form an orthogonal set. This orthogonal 
set is complete because every entire function has a convergent power series on 
C. Thus, if we choose c, = ET we normalize our set and, hence, the resulting 
sequence {u,,} is an orthonormal basis for Be. oO 

Now we specialize further by choosing the family of weight functions given by 
w(z) = el" fora € R with 0 < a < 1. (Note that in the notation of (13) and 
of Theorem 9 above, we then have a = 1 if a = 1 anda = Oif 0 < a < 1.) We 
will call the resulting Hilbert space H, := B2. In this case, we can actually find the 
constants c, explicitly. 


Theorem 11 ({2]) Jf0 <a <1, then forn € No, we have that 


2 2 
"lz, = Q- eet) p au | 


(04 


Proof Computing the norm in Hy gives 


my 2n oe) 7 
IIz"II, = [z|"e72" dz = 1d0 pntle—2) cn 
a 
Cc 0 0 


For the integral over r, we make the change of variable x = 27%, which changes the 
integral into 


2n+1 
foe) 1 foe) 
rt 1 2 Deed 2(n+1) 
Nett, = 2" f° ((3)") gage = ardent PE terse, 
. 0 2 2an() a 0 


However, the final integral is simply I" (2 (n+ 1)).0 
Thus we can simply take the normalizing constants c, to be the square root of the 
reciprocal of the formula for ||z"||? given above. Namely, 


al ea) 
a a 


for every n € No. Now examining the action of D on a typical basis element u,,(z), 
we see that: Du, = D(c,z") = ne,z"! = a Un: We thus obtain the following 
representation of D: 


Theorem 12 The operator D is isomorphic to a weighted backward shift on Hy 
taking a sequence of coefficients (a,) in P(C), where f(z) = v6 dnttn(2), 
t0 (YnQn41) in P(C), where for n € No, Yn > O and yp is given by y? = 
72 (nt1Pr(2 (+1) 

D(2(n+2)) 
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Proof Using the last calculation and writing f(z) = S729 dnlln, we obtain 


foe) fore) 
NCy (n as len41 
Un-| = ) Tag nie = ) Yn@n+1Un, 
n 


Cn-1 


Df@) = Yo a 


n=0 n=0 n=0 


— (tent 
Cn 


where y,, . It follows, using the previously calculated formula for c,, that 


,_ +P, tpl? _ 53 m+ 1~r(2(+4+ 1) 
a en 22) een r(2(n+2)) 


as desired. 0 

The last fact we will need from [2] is to apply the standard asymptotic for the 
Gamma function to obtain that y, ~ c- n'-z asn > oo, where c is a positive 
constant. Thus if 0 < a < 1, then y, — 0 asn —> oo. 

Continuing beyond the results from [2], we start by calculating the adjoint D*. 


Theorem 13 Given f € Hy, let f = ~~) ant be its expansion in terms of the 
orthonormal basis. The adjoint of D* is isomorphic to a weighted forward shift 
given by the equation D* (dy) = (Yn—14n—1)- 


Proof To calculate D*, write D*f = en bpUn. Since {u,} is an orthonormal 
basis we find the nth, for n > 1, coefficient of D*f: (D*f,u,) = (f,Dun) = 
(f. Yn—1Un—1) = Yn—1Gn—1- Thus we have b, = Yn—1a4,—1 for each n > 1. For bo, 
we calculate (D*f, uo) = (f, Duo) = (f,0) = 0. Thus, D* acts on the sequence of 
coefficients (a,) as a weighted forward shift (a,)  (Yn—14n—1), With the new Oth 
coefficient being 0. O 

Now that we have the adjoint, we can immediately see that D is not self-adjoint as 
D is a backward shift and D* is a forward shift. Moreover, the following calculation 
with f(z) = 1 shows that it is not even normal: Indeed, D*Df = D*0 = 0, 
but, on the other hand, DD*f = D(yoz) = yo. This shows that we cannot apply 
the functional calculus for unbounded normal operators that was used in [20-23]. 
Instead we use the Riesz functional calculus, which is valid for bounded operators 
like D. 

Next, we will use the asymptotic y, ~ c- n'~« to determine which trace ideals 
D will belong to, depending on a. 


Theorem 14 The operator D is compact on Hy for any 0 < a < 1, trace class for 
any0 <a < 5, Hilbert-Schmidt for any 0 < a < z, and, in general, D € J, if 
a< Pre for any p € N, where J, is the trace ideal defined in the previous section. 

Proof Let0 < a < 1, and let Ey : H, — Hy, that takes a power series 
paar AnUy b> sae AnAn—1Un—1, Which is the composition DPy of the derivative 
operator D with the projection onto the subspace of polynomials at most degree 
N, Py. Each Ey is of finite rank, in fact, the range of Ey has dimension N. We 


claim that the norm limit of Ey is D = 4. Note that ||D — Ey|| = sup,.y{An-1}. 
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But 2, ~ cena 0, asn — oo, for any 0 < a < 1. Thus Ey 
converges to D in norm and therefore D is compact. Furthermore, we can write 
D= im 1 An—1 (Un, *)Un—1 SO A , are the singular values of D. To determine when 
Ax) are a , we use the Limit Comparison Test to compare }7?2 | (A*_,)? with 


yy (n'~a a), which converges if and only if p (1 - +) < —1. Solving this ewes 


n=1 


a<5 rea . Therefore D € ke ifa< ea and, in particular, is trace class if p < 5 + and 
Hilbert Schmidt if p< z (Here, we have used the notation of Definition 5.) j 
From now on, we will fix an a withO <a < 5 and simply refer to Hy as H.1n 


this case, we have the following spectrum for D. 


Theorem 15 We have o(D) = 0,(D) = {0} and 0 is a simple eigenvalue of D with 
eigenfunction f(z) = 1, the constant function equal to 1. 


Proof We know from Theorem 9 that o(D) = A,, where a = lim eo) , as in (13). 
Here we have $(t) = t for0 < a < 5. Thus we have that lim ra lim 1°"! = 0. 
too too 


It follows that a = 0 and hence, by miso 9, a(D) = Ao = {0}. However, we 
also know that f(z) = 1 € H, so that D has the eigenvector f corresponding to the 
eigenvalue 0 and the point spectrum of D is also o,(D) = {0}. 0 

Finally, we turn to considering the set of operators {e~°?};<c. We compare this to 
the result for 0, obtained in [23] and mentioned in Sect. 1. This theorem will show 
that D is the infinitesimal shift (of the complex plane). 


Theorem 16 The family {e~°?},<c gives the group of translation operators on H. 


Proof First note that since any f € H is an entire function, we have the convergent 


power series representation: f(z— s) = )°™, f a (—s)" for any z,s € C. Thus 
CO le.) 
e"'F@=)>- “(-sb) "*@Q= >, mo FO = f@-5). 
n= a = 


This shows that e~*? just acts as translation by s on the space H. From this 
expression we also see that lim||e~’f — || = 0. 0 
50 


4 The Construction 


With the results about D in hand, we turn to constructing an operator that might play 
the role of Frobenius when dealing with the Riemann zeta function or other entire 
or meromorphic functions of interest in number theory, analysis, or mathematical 
physics. 
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We begin by considering a particular choice of the family of analytic functions 
o:(z) = z+T. This gives us operators 


D, := ¢,(D) =D +t, (18) 


for which the following lemma holds. (Recall that a has been fixed once for all to 
satisfy 0 <a < 5 and hence, that Theorem 15 applies.) 


Lemma 2 For anyt € C, D, € B(A) with spectrum o(D,) = {t}. If t # 0, then 
D, is invertible and D;' € B(A). 


Proof Applying the functional calculus on bounded operators along with the 
Spectral Mapping Theorem to the operator D and the function ¢,(z) gives a 
bounded operator D, with spectrum o(D,) = ¢,({0}) = {t}, where we have used 
Theorem 15 according to which o(D) = {0}. Furthermore, if 7 4 0, then 0 ¢ o(D,) 
and it follows that D, has a bounded inverse. 0 

This gives us a family of operators, each of whose spectra are each a single point, 
which can be any complex number. Recall that in the situation of the cohomology 
theory that helped prove the Weil conjectures, we would like an operator whose 
eigenvalues on different cohomology spaces are the zeros and poles of the zeta 
function we are interested in. In order to obtain an operator whose spectrum can 
represent the zero or pole set of a meromorphic function, we use the following 
construction. If Z = {z,,2,...} is a (finite or countable) multiset of complex 
numbers, let H,, be a copy of the weighted Bergman space H and associate an 
operator D,, to be D,, on H,,. (Here and thereafter, a multiset is a set with integer 
multiplicities.) Finally, define the Hilbert space Hz = @,, H,, with operator Dz = 
@®,, Dn. This gives 


Theorem 17 For Z = {2z,,2,...}, the operator Dz, constructed above, has 
spectrum o(Dz) = {Z1,2Z,...}. Furthermore, for each i € N, z; is an eigenvalue 
of Dz and the number of linearly independent eigenvectors of z; for Dz in Hz is 
equal to the number of times z; occurs in the multiset Z = {z, 22, ...}- 


Proof For eachn €N, let e, € Hz be the element which is the constant function, 
with value | in the nth component, and 0 in every other component. Then Dze, = 
Zn€n and so Z, is an eigenvalue with eigenvector e,. Suppose Z,, = Zn. = ++: = 
Zy, = z. Then z is an eigenvalue with eigenvectors e€,,, @n),...@,, and so there are at 
least as many linearly independent eigenvectors of z for Dz as the multiplicity of z 
in the multiset. Next, recall that the only eigenvalue of é on H is 0. Thus, the only 
eigenvalue of D,, is z,. Suppose now that Dzx = zx for some x, we must either have 
the nth component of x being 0 or z = z, and so there cannot be any more linearly 
independent eigenvectors of z for Dz. Now that we know z; is an eigenvalue of Dz 
for each i, we know that {z), Z2,...} C o(Dz). Next, let A € C — {z,, 2,...}. Then 
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d = inf,>0 |A — z,| > 0. Since D = 4 is quasinilpotent, r(D) = 0 and so there is a 
positive integer N such that for every integer k > N, we have ||D*|| < a Then, on 


ko, 
the nth component of Hz, we have that )°72.9 Wal is absolutely convergent because 


k 
3 a 3 () _ 
k=N JA = zn] k=N " a 


Then we can calculate the inverse on the nth component via the absolutely 
convergent series: 


co 


1 D* 
AI—D.)7! = ((A-—z)I—-D)7! = 
(AI —Dz,)"! = (Az) =D) era See 


Further, by the same estimate ||(AJ — D.,)~!|| < C uniformly in 1, where 


SIDE jy 

C=), a (19) 
k=0 

Therefore, (AI — D.,)~' € B(Hz) and so (AI — Dz)! exists and is bounded. 

That is, A € p(Dz), the resolvent set of Dz; recall that by definition, o(Dz) is the 

complement of p(Dz) in C. Hence, o(Dz) = {z1,2,...}.0 


Corollary 1 [f Z = {z1,2,...} is either the zero or pole set of a meromorphic 
function f(z), then Z is a discrete set and so we have exactly o(Dz) = Z counting 
multiplicity. Moreover, each z; in Z is an eigenvalue of Dz, with multiplicity equal 
to the multiplicity of z; in the multiset Z. 

Thus, Dz has all of the information from the multiset {z), z2,...} contained 
in its spectrum. If we then consider the multiset to be the zeros and poles 
of a meromorphic function f(z), then the operator Dz contains these pieces of 
information of this function. This was, in fact, the original goal of this direction. 
In [23] (also [20, 21], and [22]) the spectrum of the operators studied there were 
vertical lines in the complex plane and the values of ¢(s) on these lines. This work 
was meant to approach the problem in a similar, but different, way and isolate out 
the zeros and poles of certain meromorphic functions. Theorem 17 gives a positive 
result that we have created such an operator. However, we wanted to go further 
and find a way to use determinant formulas to fully recover all of the values of the 
desired function as was done in the Weil conjectures. Unfortunately, we cannot use 
the determinant formulas for operators given in Sect. 2 for the operator Dz to recover 
f(z) as a whole, because with this formulation Dz is not trace class. Even looking at 
just a single one of the terms, D, = D + tI, with t 4 0, is not compact, let alone 
trace class (or, consequently, in any of the operator ideals J,,), because our Hilbert 
space is infinite-dimensional. (Indeed, in a Banach space, the identity operator, like 
the closed unit ball, is compact if and only if the space is finite-dimensional.) 
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The first such modification we will make is to restrict each D, to its eigenspace, 
E, the space of constant functions. This restriction is then a compact operator. 
We also need to make a second adjustment from the original idea. Any zero or 
pole set of a meromorphic function will be a discrete set, and hence if there are 
infinitely many zeros or poles, they must tend to oo. This would then imply that the 
operator Dz described here is unbounded. What allows us to repair this problem and 
simultaneously recover the given function f(z) using determinants, is to have our 
set Z consist of the reciprocals of the zeros rather than of the zeros themselves, and 
similarly for the poles. (In each case, the multiplicities of the zeros or the poles are 
taken into account.) The set of reciprocals will not necessarily be discrete as if the set 
is infinite, the sequence will tend to 0. However, rather than being a problem, this is 
actually completely required. Indeed, a compact operator on an infinite-dimensional 
space cannot have a bounded inverse and so 0 must be in the spectrum of any 
such compact operator. Combining this observation with the fact that regularized 
determinants apply only to trace ideals of compact operators, we see that having 0 
in the closure of the set of reciprocals is necessary to apply the determinant theory 
to Dz. 

One comment to make about the restriction of the operator Dz to its total 
eigenspace, E7, is that it simplifies the operator to a multiplication operator because 
the derivative on constant functions is just the zero operator. This is unfortunate as 
we do lose some of the rich setting of the Bergman space that has been used thus 
far. We are currently exploring alternative constructions in [5] that would allow us 
to remove this restriction and work on all of H. However, as we will see in the next 
section, the new version of the operator obtained by restriction will retain the desired 
spectrum from Theorem 17. 

In addition, we observe that in some sense, by analogy with what happens for 
curves over finite fields for Weil-type cohomologies and with what is expected 
in more general situations associated with the Riemann zeta function and other 
L-functions (see, e.g., [10, 12] and [25, esp. Appendix B]), Ez (the total eigenspace 
of Dz) is the counterpart in our context of the total cohomology space (or, in 
the terminology of [25, 28], the total “fractal cohomology space”) and corre- 
spondingly, the restriction of the original (generalized) Polya—Hilbert operator Dz 
to its eigenspace Ez is the counterpart of the linear endomorphism induced by 
the Frobenius morphism on the (total) cohomology space. (See Sect. 1.1 above.) 
Therefore, this modification of the original operator seems natural (and perhaps 
unavoidable) in order to obtain a suitable determinant formula, of the type obtained 
in Sects. 4.1 and 5 below. 


4.1 Refining the Operator of a Meromorphic Function 


First, let Z = (z,) be a sequence of complex numbers. Let D, := D + z,I be the 
operator in the previous section restricted to the subspace of constant functions E. 
(It is clear that D,, is normal.) Let Dz = @,, D, act on the space Ez = @,, E which 


Regularized Determinants and Riemann Zeros 53 


is a closed subspace of the Hilbert space Hz from the last section. So in actuality, 
this new definition of Dz is just the restriction to the Hilbert space Ez of the operator 
given in the previous section. First we note that this restriction still retains the main 
property from the last section. 


Theorem 18 For each n €N, each z, is an eigenvalue of Dz and the number of 
linearly independent eigenvectors associated with z, is equal to the number of times 
Z, occurs in the sequence Z. Furthermore, o(Dz) = {Z, :n = 1,2,3,...}. 


Proof Let e, be the eigenfunctions from the previous proof. Then since e, is 
constant in each coordinate, e, € Ez. Thus when restricted to the space of functions 
constant on each coordinate, Dz retains all of its eigenvalues and eigenvectors 
from before. Finally, we note that o(D.,) = {z,} from which it follows that 
o(Dz) = {z, :n = 1,2,3,...} as in the proof of Theorem 17. 0 


Remark I Note that in the case when Z = (z,) is the sequence of the reciprocals of 
the nonzero elements in the zero set or the pole set of a given meromorphic function 
(as in Sect.5 below), then o(Dz) = Z if Z is finite and o(Dz) = Z\_{0} if Z is 
infinite. 

The next theorem shows that this restriction of the operator will truly give us 
what we need for our quantized number theory framework. 


Theorem 19 We have the following relationships between an infinite sequence Z = 
(Zn) and the associated operator Dz. 


1) Dz is bounded iff (z,) is a bounded sequence. 
2) Dz is self-adjoint iff Zz, € R for all n. 
3) Dz is compact iff lim z, = 0. 

noo 


4) Dz is Hilbert-Schmiadt iff \~-~ , |Zn|? < 00. 
5) Dz is trace class iff °°, |Zn| < 00. 
6) For p > 1, Dz € Ip iff Yom, |Znl? < 00. 


If (Zn) is a finite sequence, then Dz is bounded, compact, and in J, for each p = 1. 


Proof Since ||D,|| = |zn|, for each n € N, we have ||Dz|| = sup, |z,|. Then Dz is 
bounded iff (z,,) is a bounded sequence. For 2), consider the sequence of operators 
Dy = ar D,, for N € N, as an operator on Ez by letting it act as multiplication 
by 0 on the remaining components. Thus Dy is a finite rank operator on Ez for each 
N. Then ||Dz — Dy || = sup,.y |ze| and so if lim,—+oo Zn = 0, we then have that Dz 
is the norm limit of finite rank operators and thus is compact. On the other hand, if 
limy-+o0 Zn # 0, then {e,} is a bounded sequence of vectors such that {Dze,} has 
no convergent subsequence. Thus, Dz is not compact. For 3) and 4), assume that Dz 
is compact. Then since Dze, = Z,€, and the fact that {e,} forms an orthonormal 
basis for Ez we know the singular values of Ez are {z*}, which is just the sequence 
of numbers |z,| arranged in nonincreasing order. Thus Ez is Hilbert-Schmidt iff 
yr lenl? < 00, and, trace class iff }°° , |z,| < oo and, more generally, in J, iff 
ye zn|? < oo. Finally, if (z,) is a finite sequence, then Dz is actually a finite rank 
operator and is trivially bounded, compact, and in J, for each p > 1.0 
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Note that the above result is well known from the theory of multiplication 
operators on sequence spaces and is just translated here in our setting. Now that 
we have a formulation that can indeed give us a trace class operator, we can state 
the result we will use to fully recover certain functions of interest. 


Corollary 2 /f {zn} is a sequence of complex numbers satisfying Y~>~, |Zn| < 08, 
then we have det(I — zDz) = [],,(1 — Zn2). 


Proof This is a direct consequence of Eq. (8) for trace class operators of which Dz 
is one when the series is absolutely summable. 0 

By the previous corollary, we can now see that we will be getting an entire 
function out of our construction. Thus if we want to handle meromorphic functions, 
we will need to handle zeros and poles separately. Also, we will want to choose our 
sequence (z,,) to be the reciprocals of the zeros and, separately, of the poles. With 
this in mind, we make the following final construction for our operator associated 
with a meromorphic function. 

First, let f(z) be an entire function on C with z = O not a zero of f. Let 
{an} be a sequence of the zeros of f(z), counting multiplicity. Define the sequence 
Z = (Z,) where z, = oe Define Dz as before and call this Dr. Now given an integer 
m > 1, if we have Dy € Jm \ Jm—1, then det,,(7 — zDz) is well defined, where the 
regularized determinant det,, was defined in Sect. 2. (See, especially, Definition 6 
and Theorem 5.) Finally, we note that if we are dealing with a meromorphic 
function instead of an entire function, we follow the lead from the proof of the Weil 
conjectures to simply take the ratio of these regularized determinants, with possibly 
the operator associated with the numerator being in different trace classes (that is, 
in different operator ideals) than that of the denominator. 

In the next section, we will examine what this construction accomplishes for 
several classically important functions in number theory. 


5 Applications of the Construction 


In this section, we apply our construction to some special meromorphic functions of 
interest and conclude with showing that this construction does indeed give a suitable 
replacement for the Frobenius for any entire function of finite order. 


5.1 Rational Functions 


To begin, we look at the simplest type of meromorphic functions: the rational 
functions. Let f(z) be a rational function. Then, we can write f(z) = zg(z) for 


h S 
some k € Z and further g(z) = 2(0) "2, where h(z) = I] (: - =) and 
k(z) n 


n=1 
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t 
z : 
k(z) = I] (1 a +) for some finite set {a,d2,..., 4s, 1, b2,..., b;}. Construct 
n=1 iG 
the operator Dj) and Dy) as given in the previous chapter. The following theorem 
tells us that the determinant exactly recovers the given function /. 


Theorem 20 /f f(z) is a rational function as above, then 


det, d = zZDniz)) 


ok 
f@ =Z g(0) det; (J — ZDxz)) : 


(20) 


Proof Write out f(z) as given in the preceding paragraph. Then consider the finite 
sequences Z = Cee Peas +} and P = es oie mae The operators Dz and Dp 


are both trivially trace class since both are created from finite sequences. Hence, 

we may apply the 1-regularized determinant (really, just the normal Fredholm 

determinant since both operators are trace class) we defined to obtain 
deti(2— Diy) — Mn=id-%) _ g@) 


deti(7—zDig) Trai - £) (0) 


(21) 


ke Weg) 
and thus, f(z) = z 8(0) 02D," 


We consider Dj,z) to be the analog of Frobenius for h(z) (zeros) whereas Dy,z) 
would be the analog for k(z) (poles). Then this ratio of determinants would be 
considered a graded determinant associated with the Frobenius of the divisor (zeros 
minus poles) of g(z). 


5.2. Zeta Functions of Curves over Finite Fields 


Recall that the zeta function of a (smooth, algebraic) curve Y over the finite field 
CO 


Yn 
F, is defined as ¢y(s) = exp (>: “-") The proof of the Weil conjectures 
n 
n=1 
expressed this function as an alternating product of determinants as follows: 
det(I — F*q~°|H') 
det(I — F*q75|H°) det(I — F*q75|H?) 


by(s) = 


One of the Weil conjectures, that ¢y(s) is a rational function of g~*, then followed 
from this formula. Thus we may apply the result in the previous section about 
rational functions to obtain the following theorem. 

Theorem 21 Let Y be a smooth, projective, geometrically connected curve over Fy, 


the field with q elements. Write Cy(s) = f i with f (z), g(z) both polynomials. Then 


det; (J — q~*Dy) 
det; (I—q7°D,) 


Cy(s) = (22) 


56 T. Cobler and M.L. Lapidus 


Proof We have that ra is a rational function of z. Thus by the rational function 


f@ det U—zDy) 


result: iO. deh = Das) and so replacing with z = gq“ gives: Cy(s) = 
besa) as desired. O 
det; (I-q7"De) i : 


Note that the results in this subsection can be extended in a straightforward 
manner to the zeta function of a (smooth, algebraic) d-dimensional variety over 
F,, where the integer d > | is arbitrary. 


5.3. The Gamma Function 


The next meromorphic function that we will turn our attention to is the Gamma 
Co 


function, defined initially by '(z) = / xe dx for H(z) > 1. This function 


has numerous applications in many Branches of mathematics, including our focus— 
number theory. One point of interest is that this function gives a meromorphic 
continuation to all of C of the factorial function on integers. It also appears in the 
functional equation for the Riemann zeta function. We have the following well- 
known properties of the Gamma function: 


Theorem 22 Forze C, z €¢ {0,—1,—2,...}, we have 
1) P@+1) = zl (2). 
2) (an) = (n—-1)! forneN. 


2 TQ= "Tl (( + sy") 
n=1 


This infinite product representation for I"(z) allows us to now show that we can 
recover the function from the determinant of the operator construction we have 
laid out. 


Theorem 23 We have that for all z € C, 
aie 1 
z dety(I—zDzrw@) 


rEe=< (23) 


Proof We will apply our construction to the function g(z) = ae This function is 
entire and has a simple zero at each negative integer. Note that the residue of I"(z) 
at z = 0 is 1; so that g(0) = 1. Now if we consider the sequence, Z = (—+), 
of reciprocals of zeros of g(z), we see that it is not a summable series but that it 
is square summable. This means the associated operator Dz is not trace class, but 
only Hilbert-Schmidt. This forces us to use det, in our definition of the regularized 


determinant. In fact, 


oe) 


det,(J — Dz) =] [( is ~)en* |. (24) 


n=1 
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This then leads to the following computation: 


a eee -1 
denU=Dare) — deta — zz) 


ll 
—l2 
|_| 
— 
+ 
SIN 
od 
® 
| 


= ze’*I'(z). 


Therefore, we conclude that I"(z) is given by Eq. (23), as desired. O 


5.4 The Riemann Zeta Function 


Next, we turn our attention to another important example, the Riemann zeta 
function. First, we will consider the well-known Euler product expression for ¢(s). 


Theorem 24 Fors € C, with (s) > 1, 


¢s)=[Ja-p°y. 
Pp 


where the product is taken over all prime numbers p. 

To use our formulation, let #(z) = 1—z. Then, by the result for rational functions 
in Sect. 5.1 (Theorem 20), we have that 6(z) = det(J—zDg), which is true for every 
value of z # 1. Letting z = p™ then gives (1 — p‘)~' = (det(I — p““Dg))~! for 


s # 2aik 'k € Z. This leads to the following operator based Euler product: 
logp 


Theorem 25 Fors € C, with R(s) > 1, 


¢(s) = | [(det —p™Dy)) 1, (25) 


Pp 


where the product is taken over the primes p. 


Proof We simply apply the determinant equality to each term in the infinite product 
and then use the standard Euler product convergence. Note that for i(s) > 1, we 
never have s = i a for any integer k; so that the determinant equality does apply at 
each prime p. O 

The completed zeta function, &(s) = im 25(s —- Dr (5) C(s), is an entire 
function whose zeros all lie in the critical strip {s ¢ C : 0 < R(s) < 1} and 
coincide with the critical zeros of ¢(s). We have the following well-known product 
representation for &(s). 
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Theorem 26 Fors € C with X(s) > 1, 


(9) = 5m rietean- DTT | (1 - =) e|, (26) 


p 


where y denotes Euler’s constant and the infinite product over p is taken over all of 
the zeros of &(s), which are the nontrivial (or critical) zeros of €(s). 

Now if we wish to express §(s) by using the determinant comeenon in this 
paper, we need to consider Z = {1} and the convergence of Pane . It is proven in 
[13] that this series converges for p = 1, but only conditionally ad so we will need 
p = 2 to get the absolute convergence needed for Dz € Jz. Thus we must consider 
the determinant det (J — sD¢,s)). This suggests the following theorem. 


Theorem 27 For alls € C, 
1 5 y 
f() = 503 ellos2n)-1-5)s dets(I — sDg). (27) 


Proof From the preceding discussion, we begin by defining Z = ae and 
constructing D¢(s) = Dz. Then we calculate 


det2(/ _ sDé(s)) = det. (I — sDz) 
aI)" 


26 (8) 


e=5 e(log(2x)—1—5)s , 


Thus, &(s) = 5m 3 e(los(2n)—1—5)s det)(J — sDg), as desired. This result is first 
obtained for H(s) > 1, and then upon analytic continuation, for all s € C. Indeed, 
both &(s) and the regularized determinant define entire functions of s. 0 

We can then combine the results for &(s) and I"(s) to give an expression for ¢(s) 
in a similar spirit to the representation of zeta functions of curves over finite fields, 
as follows. 


Theorem 28 For alls € C, 


ellog(2x)—I)s det2(7 — 5Dsr(3)) detz(7 — sD¢) 
2 det, Cd = sDg) 


¢(s) = — , (28) 


where det, (I— 5D r(3)) gives the trivial zeros of §(s), detz(I—sDe) gives the critical 
zeros of zeta, and det, (I — sDg) gives the single pole at s = 1 with ¢(s) := 1—s. 
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Proof We first recall the following three equations (see, in particular, Theorems 23 
and 27), valid for all s € C: 


r (:) _ 2e772 1 
2 AY det2(J — 5Dsr(sy) 


£(s) = 5x fs(s— Dr (5) 668, 
and 
E(s) = s ~3 ellos2n)—1-5)s det, (I — sDe). 


We then solve for ¢(s) in the middle equation and substitute the other two to obtain 
successively: 


212 £(s) 
s(s— )I(5) 
5 3 elos@n)— '—35 deto(I — sDe) 
s(s — 1) (det) (I — §Ds r(sy))7! 
e(log(27)—1)s det. (I _— 5Dsr(s))) det (7 = sD) 
2 (s—1) 


G(s) = 


Finally, letting #(s) = 1 — s and using Theorem 20, since ¢ is a rational function, 
we can replace s — 1 = — det, (J — sDg) and obtain the desired equation. 0 

We will conclude this section with a different approach that gives an equivalent 
criterion for the Riemann Hypothesis. Let Z be the set of zeros of the function 
defined by é (s) =& (5 + is). Construct the operator Dz = D;. This leads to the 
following result. 


Theorem 29 The operator D; is self-adjoint if and only if the Riemann hypothesis 
is true. 


Proof This follows directly from part (2) of Theorem 19 and the fact that the 
Riemann Hypothesis says that the zeros of & (3 + is) must all be real. 0 

It should be stressed that Theorem 29 does not as yet provide an approach to the 
Riemann hypothesis, for some of the reasons outlined in Sect. 6. We also note that 
Theorem 27 is potentially more useful than Theorem 28 (in part because it does not 
involve a determinant associated with the gamma function). 
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5.5  Hadamard’s Factorization Theorem of Entire Functions 


In this section, we observe that the theory presented here is quite general. It will 
apply to all entire functions of finite order. We will begin with an overview of the 
concepts of rank, genus, and order of an entire function as described in [7]. 


Definition 8 Let f be an entire function with zeros {a), a2,...}, repeated according 
to multiplicity and arranged such that |a,| < |a2| <---. Then is said to be of finite 
rank if there is an integer p such that )°°°; |an|~*) < oo. If p is the smallest 
nonnegative integer such that this occurs, then f is said to be of rank p; a function 
with only a finite number of zeros has rank 0. A function is said to be of infinite rank 
if it is not of finite rank. 

In order to define the genus of an entire function, we need to define what it means 
for an entire function to be written in standard form, which will require the following 
definition. 


Definition 9 For n € No, define the elementary factor 


_ pul=2), ifn = 0, 


E,(2) : 2 on : 

(l=gyexp() > 3 betes) ae d. 

To justify the definition of elementary factor, simply note that if }°°°,, |a,|~?T) < 
oo, then the infinite product []™, E,(=) converges uniformly on compact subsets 
of C and defines an entire function with zeros at the complex numbers a,,n > 1. 
The exponential factor is what is needed to ensure convergence of the infinite 
product. With this definition in hand, we can, in turn, define the genus of an entire 
function: 


Definition 10 An entire function has finite genus if the following statements hold: 
1) f has finite rank p and 2) f(z) = z"e8 T°, E, (2) where g(z) is a polynomial 


G 


of degree gq. In this case, the genus of f is defined by 4 = max(p, q). 
We now define the order of an entire function: 


Definition 11 An entire function f is said to be of finite order if there is a positive 
constant a and ro > 0 such that |f(z)| < exp(|z|*) for |z| > ro. If f is not of finite 
order, then f is said to be of infinite order. If f is of finite order, then the number 
A = inff{a: |f(z)| < exp(|z|*) for |z| sufficiently large} is called the order of f. 

Thus the order of an entire function f is a measure of the growth of |f(z)| as 
|zZ] — co whereas the rank of f is based on the growth of the nth smallest zero as 
n — oo. From the definitions, there is no inherent relationship between the two 
concepts, but with the following version of the Hadamard factorization theorem, we 
see that they are in fact closely related: 
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Theorem 30 (Hadamard Factorization Theorem) /f f(z) is an entire function of 
finite order i, thenf has finite genus 4 < 0 and f admits the following factorization: 


f@ =" TE, (=) . (29) 


n=1 ms 


where g(z) is a polynomial of degree q < 4 and p = [A], the integer part of 2. In 
particular, f is of rank not exceeding p. 

Now when we apply our operator construction to a given entire function of finite 
order we obtain a Quantized Hadamard Factorization Theorem. 


Theorem 31 (Quantized Hadamard Factorization Theorem) /f f(z) is an entire 
function of finite order 4, then f admits the following factorization: 


F(2) = 2"e% detps 1 (I — Dy), (30) 
where g(z) is a polynomial of degree q < 4, and p = [A]. 


Proof By the standard Hadamard factorization theorem, we can write 


f@) = 2" TE, (2) . G31) 


n=1 ic 


where g(z) is a polynomial of degree g < A and p = [A], with the rank of f 
not exceeding p. That is, if {a,,a,...} is the multiset of zeros of f(z) including 


multiplicity, then }°P2 , ae < oo. Thus if Z = oe a ...}, the associated 


operator Dz € Jp41. Then we can calculate successively: 


det,41(/ — zDyp) = det,41 (7 — zDz) 


0° Pp j 

z z 
= (: - =) exp — 
an 1 Jn 


f(@ 


zm es (z)* 


Thus we have that f(z) = 7”e8 det,+1(/ — zDy(,), a8 desired. O 

In the above proof, we see that the extra convergence factor in the regularized 
determinants is exactly the same as the one for the elementary factor in the infinite 
product representation of entire functions, which validates, in some sense, the choice 
in this paper for the type of regularized determinants as those based on trace ideals. 
Thus the convergence factors needed in the usual Hadamard Factorization Theorem 
have an interpretation here relating to Tr(D”). 


62 T. Cobler and M.L. Lapidus 
6 Conclusion 


We ended the previous section by giving what we called the Quantized Hadamard 
Factorization Theorem. This showed that the construction given in this paper can 
apply to any entire function of finite order and then, by taking ratios of determinants, 
can be extended to any meromorphic function which is a ratio of two such entire 
functions. This was worked out explicitly for the Riemann zeta function (see 
Theorems 27 and 28 above), and it has also been worked out by the authors for 
zeta functions of self-similar strings, both in the lattice and nonlattice case. (See 
[28, Chaps. 2 and 3] for background on self-similar fractal strings.) However, there 
was nothing in the construction preventing us from applying our results to even 
more general number-theoretic functions. In particular, a natural idea would be to 
try to extend our determinant formulas to other L-functions (see [33]). Could we 
then apply this construction to any zeta function (or, at least, to most zeta functions) 
from arithmetic geometry? This would require, essentially, knowledge about the 
existence of suitable meromorphic extensions of such functions, as well as about the 
asymptotic behavior of the zeros and poles of such extensions. Phrased differently, 
the Z-functions for which our methods could be applied are those which can be 
suitably completed to become entire functions of finite order (or ratios of such entire 
functions). Furthermore, this naturally brings the consideration of the Selberg class 
of functions. See [33] or [25, Appendix E] (and the many references therein) for a 
discussion of these functions. 

Another direction to take is to further justify why using ratios of these deter- 
minants is the correct method for handling meromorphic functions. In [25], the 
second author considers the properties of the Riemann zeta function as related to 
supersymmetric theory in physics and this ratio of determinants can be explained 
as a (regularized) Berezinian determinant from the theory of super linear algebra. 
However, this will not be further discussed here, but could be crucial for expanding 
upon the ideas presented in this work. 

In this paper, we obtained a quantized version of the Hadamard factorization 
theorem, Theorem 31, but we expect to be able to generalize this result to obtain a 
quantized Weierstrass product formula; see [5]. 

With all of the successes obtained here, we must also admit the failures of this 
theory, at least in its present stage of its development. The construction of the 
operator for €(s) explicitly assumed knowledge of the zeros of ¢(s) and thus one 
could never hope to prove (RH) directly with this method. However, if we could find 
a different way to obtain the same function, by comparison you could extrapolate the 
zeros as was done with the Weil conjectures. That is, we need a suitable geometry 
and cohomology theory that would result in the same ratios of determinants of these 
operators. In the Weil conjectures, the geometry or points on the curve (over F,, 
the algebraic closure of F,) corresponded to the fixed points of powers (or iterates) 
of the Frobenius operator. (Recall from our discussion just prior to Sect. 4.1 that in 
our context, the “fractal cohomology space” would seem to be the total eigenspace 
Ez to which we restricted the original generalized Polya—Hilbert operator, viewed 
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as Frobenius acting on an appropriate analog of the underlying “curve.”) Could we 
then consider the fixed points of the operator constructed in this paper? Analytically, 
this can be done by considering a suitable notion of generalized eigenfunctions 
(viewed as generalized tempered distributions). Thus far, however, this idea has not 
led to the development of a suitable working theory for the geometry underlying 
€(s). Providing an appropriate geometric framework is one of our main long-term 
objectives for future research on this subject. 

Another interesting and related question (connected, in particular, to our dis- 
cussion in Sects. 1.1 and 1.3) is whether the still conjectural fractal cohomology 
theory satisfies a suitable analogue of the Lefschetz fixed point formula (as stated in 
Theorem 1) for the counterpart of Frobenius. 

One additional plan that we are currently working on is to rephrase the 
construction we have described here as a cohomology of sheaves in order to properly 
transition from the local setup given in this paper to a more global approach that 
might give new and useful information. 
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The Temptation of the Exceptional Characters 


John B. Friedlander and Henryk Iwaniec 


In celebration of forty years of collaboration 


Abstract We survey some of the history and results related to the topic of the title 
with an emphasis admittedly biased toward our joint works thereon. 


1 Introduction 


Both on our own and together, the authors have been fascinated by the problems 
posed by the possible existence of exceptional Dirichlet characters and the profound 
influence these would play in some of the basic questions of multiplicative number 
theory. We take this opportunity to survey some of the history and results related to 
this topic, with an emphasis admittedly biased toward our joint works thereon. 


2 The Class Number Formula 


Let K be a number field and let D denote the absolute value of its discriminant. 
Recall that the Dedekind zeta-function of K 
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has a simple pole at s = 1, with residue k given in terms of some of the most 
important field constants by 


_ 20+ 7PA(K)R 
o/D 


where h is the (wide) class number, R is the regulator, @ is the number of roots of 
unity, and 7;,72 are the number of real (respectively, complex conjugate pairs of) 
embeddings of K into the complex plane. 

We shall throughout restrict ourselves to quadratic fields, so K = Q(./ y(—1)D) 
where y = Xp is the real primitive Dirichlet character of conductor D. Now, the 
above constants simplify substantially and, by the Dirichlet class number formula, 
the residue is also given by 


K 


LU. x) = Do xan. 


n21 


We shall now denote the class number by /(+D) rather than h(K) and, since the 
class number is the order of a finite group, it follows that L(1, yp) > 0, which we 
recall was a pivotal step in Dirichlet’s proof [4]: 


Theorem 1 Let g be a positive integer and a an integer relatively prime to q. Then 
there are infinitely many prime numbers p with 


p = a(modgq). 


3 The Size of L(1, xp) 


It is important to have as much information as we can about the size of Dirichlet 
L-functions in general, especially at s = 1 and most especially for y = yp. As far 
as upper bounds, we know that 


LU, x) « log D 


and that, under assumption of the Riemann Hypothesis for the L-function in 
question, we have 


LO, x) « loglogD. 


One would like to know more but the above is pretty good when compared to the 
situation for lower bounds where one knows, under the same RH, that 


L(1, x) > (og log D)! 
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but what we know unconditionally is much less. From the class number formula it 
follows quickly that L(1, 7) >> 1//D. Moreover, for even characters one improves 
this by a factor of log D by virtue of the presence of a non-trivial regulator. 


4 The Zero-Free Region 


It is important to have a good zero-free region for L(s, 7), s = o + it, for all x, 
but the real characters give extra problems (closely related to the size of L(1, x)). 
Specifically: 


Theorem 2 (de la Vallée-Poussin, Landau [16]) There exists a constant c > 0 
such that 


Cc 


L(s,x) #0 foro > 1 —- ————__—__ , 
log(D(|t| + 1)) 
with the possible exception of a single real and simple zero B = Bp. 

We call such D, 7, 8 “exceptional (with respect to the constant c)” and Landau 
proved that they are just that! More precisely: 


Theorem 3 (Landau [16]) For any given positive M there exists c = c(M) > 
0 such that any given sequence of conductors D, exceptional with respect to the 
constant c satisfies 


Dn+i > DY 


for all n. 

Note that, in order for there to be any exceptional conductors, there must be 
an infinite family of them (since otherwise, knowing as we do that L(1, x) > 0, 
we could just choose c sufficiently small so as to rectify the finite number of 
obstructions to the inequality). 


5 The Class Number 


Theorem 4 (Hecke, Landau [16]) As D runs through non-exceptional conductors, 
we have 


1 
Ld, ——_ 
(Lx) > log D 
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and hence, for imaginary quadratic fields 


VD 
h(—D —.. 
ee log D 


A breakthrough idea came from Deuring [3]. 


Theorem 5 (Deuring [3], Mordell [19]) Jf the Riemann zeta-function has any non- 
trivial zero off the line 0 = 5 then, for imaginary quadratic fields 


h(—D) > co asD>ow. 


Deuring’s idea was extended by Heilbronn [15] to give: 


Theorem 6 (Heilbronn) /f any Dirichlet L-function has any non-trivial zero off the 
lineo = 5 then, for imaginary quadratic fields 


h(—D) > co asD>ow. 


Combining this last theorem with Hecke’s result, [15] proves a famous conjecture 
of Gauss: 


Corollary 1 (Heilbronn) For imaginary quadratic fields 
h(—D) > co asD>ow. 


Improving the quantitative estimate in Heilbronn’s argument, Chowla became 
the first to deduce 


Theorem 7 (Chowla [2]) There are only a finite number of Euler’s idoneal num- 
bers (discriminants for which there is only one class in each genus). 
Indeed, he shows that the number of classes in the principal genus goes to infinity. 
The first to beat the trivial class number bound by a fixed power was (again) 
Landau. 


Theorem 8 (Landau [17]) We have 
h(—D) > D*/(logD)* . 


Siegel sharpened that result to give what we know today, namely: 


Theorem 9 (Siegel [20]) For every ¢ and some c; > 0, we have 
Ld,7)>c.D™~. 
Corollary 2 We have 


h(—D) > D2~*. 
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Although, Siegel’s paper does not mention zeros of L- functions, the theorem 
incidentally returns attention from zeros off the line to zeros near s = 1 since, by 
the mean-value theorem of differential calculus, one has the following result. 


Corollary 3 For every ¢ and some c, > 0, we have L(o, x) # 0 for 


o>1-c,D~*. 


6 Primes in Arithmetic Progressions 


Shortly after the proofs of the prime number theorem by Hadamard and de la 
Vallée-Poussin, the latter combined his techniques with those of Dirichlet to give 
an asymptotic formula for the number of primes up to a given point, in an arithmetic 
progression. We take g=1, (a, q) = 1, and then have 


u(x; q,a) = > 1l~ a) 


pSx 
p=a( mod q) 


and are interested in the range of uniformity in g for which this asymptotic formula 
can be shown to hold. There are three ranges for which this question is particularly 
noteworthy: 


q K- xlcé ; 


which is expected and would be a dream come true; 


q<s x2, 


which follows from the assumption of RH for all Z-functions with characters of 
modulus q: 


q <v (log x)” 
which is known to hold for arbitrary N (the Siegel—Walfisz theorem). 
An important related question is that of giving good upper bounds for pe = 
pe(q,a), the least prime in the arithmetic progression in question. As immediate 
consequence of the above three ranges for the asymptotic formula one has 


pe«q'it®, pe«a@t*®, pe«exp(q'/”). 


It was an achievement of the highest order when Linnik [18] gave an uncondi- 
tional proof of a bound with a fixed power. 
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Theorem 10 Let g=2 and (a, q) = 1. There exists a positive universal constant L 
such that 


pe(q.a) <q’. 


Many improvements have ensued in the known admissible values of “Linnik’s 
constant” L. 

It seems inevitable that separate proofs are required for the two different cases, 
depending on the existence or non-existence of exceptional zeros/characters. At first 
it seemed that the arguments in the exceptional case were the more difficult but more 
recently it has emerged that sometimes they can be simpler and stronger. 

Thus, there is the following theorem of Heath-Brown [14]. 


Theorem 11 Let ¢ > 0. Then, there exists c(e) such that if there is a zero B>1 — 
c(e)/ log q then 


pega) Kg. 


What about other problems? 


7 Twin Primes 


Linnik’s theorem offers an important example wherein the complete result can be 
proven, that is with or without the assumption of exceptional characters. It turns 
out, however, that there are very striking statements which can (up to the present) 
be proven only under the assumption of exceptional characters while proofs in the 
non-exceptional case remain a (probably remote) goal. 

An example of this phenomenon was given by the following surprising theorem 
of Heath-Brown [13]. 


Theorem 12 One (at least) of the following two alternatives must hold. 


1) There exists a positive constant c such that, for every q and every y mod q there 
are no real zeros B of L(s, x) with 


B>1 


~ logq’ 


or 
2) there are infinitely many prime pairs p, p + 2. 


Remark I Recently, in [12] we have given a rather different account of this theorem. 
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8 More Primes in Arithmetic Progressions 


For the remainder of the paper we are going to survey some of our joint works on 
the consequences of the assumption of the existence of exceptional characters. Here 
is a special case of our first work on this topic [7]. 


Theorem 13 Let (a,q) = 1, r = 554, 401, and x = xp. For, 
D'<q< exp(L(I, peers) 


we have 


pe(qay<« gl. 


Actually, we get the asymptotic formula in this range. It is remarkable that one gets 
below the GRH exponent 2, but there are two important cautions. 


1. The result only holds for intervals of g restricted in terms of D. 
2. The interval is non-empty only under the “exceptional” assumption that 


L(1, xo) & (logD)"" . 


9 Primes in Short Intervals 


In [8] we proved the following result which, for short intervals, breaks the analogous 
Riemann Hypothesis barrier, albeit under constraints similar to those in the previous 
section. 


Theorem 14 Let y = Xp, x=D' with r = 18,290 and 


Then 
‘dt 


ma) —ax—y) = f 
xy 


- ar + O(L(, i(logx)”)} 


Corollary 4 /f 
LO, x) « dogxy 


then 


n(x) — a(x—y) =i: “+o 


xy logt 
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It may seem strange to see the assumed upper bound for L(1, x) phrased in terms 
of log x rather than log D but, of course, these are of the same order of magnitude 
when x is about a fixed power of D. 


10 Prime Values of Polynomials 


Recall that unconditionally we know [6] that there are infinitely many primes of the 
form a* + b*. We hoped but did not succeed to prove that under the assumption of 
exceptional characters one might deduce an infinitude of primes of the form a? + 1. 
With a lot more work than in our other theorems of this type, we made a little 
progress. We state a couple of consequences of the results in [9]. 


Corollary 5 Let n = { V1 — 1%dt. Suppose that 
L(1, xp)<(log D)~™ . 
Then, for 
exp((log D)'®) < x < exp((log D)'®) 
we have 
2 6 A. of I 
Yo Ale +69) = =? {1 + 0( —}} 
; 4 logx 
a+b6<x 
So, if there are an infinity of such D, then there are an infinity of primes of the form 
a +b. 
Under a very similar assumption, one deduces the infinitude of elliptic curves 


over the rationals having prime discriminants, hence only a single prime of bad 
reduction. 


11 Squares Plus Primes 


Let D — oo through fundamental discriminants (so, real quadratic fields). Let 
WV (D) denote the number of solutions of 


D=m +p. 


We write p(p) = 1 + yp(p) and note that this is just the number of solutions of the 
congruence b? = D(modp). 
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One conjectures that. (D) ~ H/D/ log D where 


— _ eN (?) ( = -) 
H= 5 (1 ) 1 
Let us define 


vor-T1(-"2) 


p<D 
and 


£(D) = L(1, yp) logD . 
Theorem 15 (see [11]) JfD > 0 and £(D) < 1, then 


N(D) = VDV(D) 1+0 (ex (-5vieei7«~)) )} 


Thus, we see: 


Corollary 6 i) If €(D) is less than a sufficiently small positive constant c, then 
N =x JDV(D) 

ii) If £(D) > 0 as D + on, then 
N ~ JDV(D). 


Note that by Mertens’ theorem we have V(D) ~ e-’H/logD, so the last 
corollary is presumably giving us an answer that is wrong by the factor e ”. This 
is reminiscent of the wrong factor spit out by the sieve of Eratosthenes in a first 
attempt at counting primes (and for essentially the same reason). 


12 Some Ideas About the Proofs 


For a glimpse of the main idea involved one can argue as follows. Let x be real and 
let us take s = 1 + € so that we have the Euler product 


Ls.) =] ](l- x@)p’) 


p 


and 


log L(s, x) = D¢ xP)p* + O-(1) . 
Pp 
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Now, L(s, x) is small if and only if log L(s, x) is large and negative and this can 
happen only if we have y(p) = —1 for most small primes p. Thus, on square- 
free integers x should behave like the M6bius function jz. This reminds us of the 
sieve. But y is periodic and, if we are looking near x which is huge compared 
to D, then the period is small, almost fixed, so our “sieve” is supercharged with 
very special (indeed, illusory) properties. On one hand, the character’s periodicity 
enables harmonic analysis, on the other hand, its Mébius-like feature detects primes. 
The combination of these features brings tastes of the Hilbert—Polya dream. 

In the next two sections, we shall sketch two different ways this simple idea can 
be utilized. 


13. The Exceptional Conductor Influencing Itself 


In this section we sketch the proof of the last stated theorem, that is we consider the 
equation D = m* + p. 
We begin with the finite sequence 


A:n=D—-m 0<m<VD 
and wish to estimate the sifted sum 
Si@jxy= Do 1 
ned 
(n,P(x))=1 

with x = /D where P(z) denotes the product of all primes p < z. We have 

d 

Ag= >) 1= PO 5 + o(p(d)) 
d 
new 
n=0( mod d) 


where p(d) denotes the number of solutions of the congruence b? = D(modd). 
For square-free d, 


p(d)SA(d) = 1 * x(a) 


where 


Yl AM@d™ = E(s)L(s, xv) = bK(s) . 


d21 


Note that A(d) = 0 if there is any prime p | d with y(p) = —1 so that, if x is 
exceptional then usually p(d) = A(d) = 0. 
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We next recall that by the fundamental lemma of sieve theory, we have 


S(@f,2z) =xV(Q){1 + O(€*)} + 8D Ald). 


dy 


Here, the parameter y may be chosen arbitrarily subject to y=z, 


logy A(p) 
list, s= qo and VO = [l= ): 


p<z 


This gives a satisfactory count for the number of integers surviving the sieve up to a 
small range z but leaves a lot more to be done. 
To proceed further we apply the Buchstab identity 


S(o,z) = S(@f,x) + D> S(hy,p) 


zZSp<x 
where &, = {£; {pe ah 
We need to bound the contribution from the sum on the right. 
1. For primes p not awfully large, say those with z<p<z~*x, we can apply an upper 
bound sieve. Although we are going to behave as though this sieve problem is 
usually of dimension zero, all we know for a fact is that its dimension is no 


larger than two. For these small primes this is good enough and applying a two- 
dimensional sieve we obtain 


S(Ay, p)<S(Ap, 2) K Mp) “V@) 


2. For the remaining primes, those with z~7x < p<x, we use the trivial bound 


SA. PS\G SAP) + AP}2A(P) 


In the normal course of affairs, this bound would ruin things, but not in our 
exceptional situation where we expect the sets .%, to be very small and A(p) to 
be zero for most p. 

Combining the above bounds, we obtain 


S(A,z) — S(A, x) K 8(z)V(z)x + 8(z7x)x 


where 


ips ee. 


z<p<D 1 
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Finally, we want to bound the function 6 in terms of L(1, x). It is convenient to use 
instead the truncated L-function 


a on TT OV"! 
roo=TT0 =) 


which is not so different and, in any case, satisfies the trivial bounds 
LU, x)(og DD)! « L*(1, x) K€ LU, x) logD. 


One can prove the following bound: 


Proposition 1 For 0 < a<1/17 we have 
6(D*) « a7! L* (1, x)%/? . 


All of these bounds are unconditional but we can deduce that L(1, 7) small 
implies L*(1, y) small which in turn implies that A(p) is small on average. 
To quickly sketch the above proposition: 


Step 1: By the Dirichlet hyperbola trick 


Yo A(n) = xL(1, x) + error. 


nx 


Step 2: By partial summation, deduce that 


se A(n)n—! = LC, y) log(x/y) + error . 


y<ndx 


Step 3: By the sieve, 


1 
De Ayn! <« I] (1 - -\(1 - EN iG, x) log(x/y) + error . 
y<n&x,(n,P(z))=1 p<z P P 
The choices of x = D, y as a convenient small power of D and z as a still 


considerably smaller power of D yield the proposition. 


14 Influence on the Nearby Conductors 


In this section we consider a somewhat different line of reasoning whereby the 
exceptional conductor casts an effect, not on itself but on other conductors fairly 
far, but not too far, away in magnitude. This is the more common type of application 
of the principle and is already there in Landau’s proof of the rarity of these objects. 
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We recall Linnik’s combinatorial identity for primes. For n=2, we have 


ae 
=o. ”) h(n) 


k 
k21 


A(n) 
logn 


where ¢,(n) is the number of ways of writing n as the product of k factors, each 
greater than one. After some technical difficulties (involving a preliminary sieving), 
this reduces the study of sums over primes 


A(n) 
s An IEA to sums > AnT,(n) 


nSx 


for the ordinary divisor functions r = 1, 2,.... 

With this framework in mind, it makes sense to introduce some simpler iden- 
tities which will operate well within our illusory backdrop. We have fx(s) = 
ene) A(n)n™ so that 


gay = Deon = T1(1 = 5)(0- 2) 


S S 
m Dp P P 


where |v(m)|<A(m)<t(m). Writing 


6’(s) _ Ls, E'(s) 
G(s) ox(s) 


we infer that A = x * log*xv = A’ x v, say. 

Here A’ = x «log and so, if x has a “comparatively small” period, then A’ should 
behave like the ordinary divisor function tT. 

Using this simple identity, we decompose the von Mangoldt function as 


A(n) = A*(n) + Ay(n) = (x + >) S> A@v(m) . 


m<D = m>D/ dm=n 


Here, A* is easy to handle since the problematic factor v, which hides the 
Mobius function, occurs with small m and 4’ behaves like t. On the other hand, 
A, contributes to the error term in what appears at first glance to be unmanageable 
form. Specifically, if we use the simple bound 


|Ax(m)|< > AAC) , 
dm=n 
then this behaves like the divisor function rt? or t4 and, for almost any sequence 
one can name, we have pretty limited knowledge of its distribution. One more 
elementary reduction is needed, a special case of Corollary 22.11 of [10]. 
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Lemma 1 Let r>2, A = x!/", d<x. Then 


t(d)< ) “(2r(5))" 


S|d 
8<A 


with some A < rlogr. 
Inputting the lemma, we see that, for n < x, 


|A(n)|<(logx) Y°(2x(8))" D2 Atm). 


5<A 5Lm=n 


Now, for each fixed 5 we have replaced A’ * A by 1 * A which is a function of 
type 73. In this way, the problem is reduced (of course, modulo technical details) to 
having to require knowledge of }°,,<, dnt (n) only for k = 1,2, 3. 

We have sufficient knowledge of these arithmetic functions to sometimes go 
beyond the square-root barrier, that is g > x'/?+" in the case of arithmetic 
progressions due to [5], and y < x!/2~” in the case of short intervals, already much 
earlier due to Atkinson [1]. One may say that the crossing of this barrier is due to 
the power of harmonic analysis on GL(3). 
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Arthur’s Truncated Eisenstein Series for 
SL(2, Z) and the Riemann Zeta Function: 
A Survey 


Dorian Goldfeld 


Abstract Eisenstein series are not in £?. Maass, and later Selberg, naively trun- 
cated the constant term of Eisenstein series (for symmetric spaces of rank 1) so that 
the resulting truncated Eisenstein series was square integrable. This procedure was 
generalized to Eisenstein series on higher rank groups by Langlands and Arthur. In 
this survey we focus on the deep connections between Eisenstein series for SL(2, Z), 
truncation, and the Riemann zeta function. Applications to zero free regions for the 
Riemann zeta function and automorphic L-functions are elucidated. 


1 Introduction 


Let h := {x+ iy | x € R,y > 0} denote the upper half plane. Then, as is well 
known, the modular group T = SL(2, Z) acts on h as follows. For y = (45) €T 
and z € h, the action of y on zis given by yz := (az + b)/(cz +d). 

The Hilbert space L? (T\h) of smooth L* automorphic functions f : h > C 
satisfying 


(S22) = f (2), for all z € h, (: z) eT, 


with Petersson inner product 
— dxdy 
(f, 8) =| Fays@ —2 3 figel (T\b), 
r\b y 
has been extensively studied. Here f € £L* means 


dxd 
i, Fo + iP? 22 <0, 
T\b BA 
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where I"\ can be taken as the region F := {z € § | |z| = 1, |9(z)| < 1/2} with 
congruent boundary points symmetric with respect to the imaginary axis. 

One of the first examples of automorphic functions discovered is the Eisenstein 
series. For a complex variable s with Ni(s) > 1 and z = x + iy € 5, the Eisenstein 
series for the modular group = SL(2, Z) is defined to be 


S 


1 y 
E(z, 8) := F 
G9) 2 DS: lez + d|?s 


cadé 
(c,d)=1 


It is not hard to show that (see [9]) 


az+b ab 
E 5) = E(z,s), f eT. 
e= s) (z, 8) or (: ’) 


In this survey, we will examine the intimate connection between Eisenstein series 
and the Riemann zeta function ¢(s), which is already apparent in the well-known 
Fourier expansion [9]: 


2n' /F 


T(s)€(2s) > O1-24(n)|n|2K,_ (2x |nly)e"™, 


n#0 
(1) 


E(z,s)=y + o(s)y'* + 


where 


_ I (s— 3) (28-1) _ : 
o(s) = Va TEs) ae 


d>1 


1% _»y - d 
KQ)= 5 [HOw &. 


u 


Here K,(y) is the modified Bessel function of the second kind. 
Consider the completed Eisenstein series 


E*(z, 8) := O*(2s)E(z, 5), 


where ¢*(s) = m~2T (s/2)¢(s) = f*(1 — 8). Then E*(z,s) has the Fourier 
expansion 


2 * s * —s sl qinx 
E*(z,8) = *(2s)y" + 6*(2—2s)y!* + 2/¥ YD o1-a5(n) [nl 2K,_ 1 (2e|nly)e? 
n#0 


from which one easily deduces that 


E*(z,s) = E*(z,1—s) 
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since K,(y) = K_;(y) for any s € C and y > 0. The analytic continuation of the 
Riemann zeta function then gives the meromorphic continuation of the Eisenstein 
series to the whole complex plane. 

The modified Riemann hypothesis for a function Z(s) asserts that all its zeros are 
either on the line t(s) = 1/2 or on the real axis in the interval (0, 1). Hejhal [11] 
proved, in 1990, that the constant term 


c*(2s)y* + 6*(2—2s)y! 


of the completed Eisenstein series satisfies the modified Riemann hypothesis if 
y = 1. His proof used the Maass-Selberg relation. This result was later extended 
in [13]. 

For fixed s € C, the Bessel function K;(y) has exponential decay in y for y > oo. 
This implies that for large y > 0, we have E(z,s) ~ y’ + (s)y!*. Consequently 


1 
s _,j2 axdy 2 Pues _,j2 axdy 
fw + oor? So ff be + best? SP = 00 
r\b y x=—} Jy=0 y 


for any value of s € C. It follows that the Eisenstein series is not in £7([\h). To 
overcome the fact that E(z, s) is not in £? and facilitate the computation of integrals 
involving Eisenstein series, Maass [16], in 1949, introduced the naive truncated 
Eisenstein series E’(z, s). Let T > 0. Then for z € h, s € C, 


E(z, s) ify <T, 


ET (z,s):= 
E(z,s)-y—(s)y* ify>T. 


This function is not continuous in z but will lie in £7. It is also not automorphic, i.e., 


az+b 
C (< +d :) eee 


for most (: ,) eT. 
cd 


In 1962, Selberg [19] extended Maass’ work on naive truncated Eisenstein series 
to the more general situation of symmetric spaces of rank 1. The inner product 
of two truncated Eisenstein series (Maass—Selberg relations), which was based on 
Green’s theorem, played a crucial role in the computation of the Selberg trace 
formula [18]. The more general Arthur—Selberg trace formula (see [15]) has been 
central for the advancement of modern number theory. 

One of the most basic tools in the theory of automorphic forms on higher rank 
groups is the truncation operator which was first introduced by Langlands [14] 
in 1966. In 1978-1980 Arthur [1, 2] made an important advance by extending 
and making much more explicit Langlands’ truncation operator, especially when 
applied to Eisenstein series. In particular, Arthur’s truncation involves a summation 
> -(-1t™M €- for a certain characteristic function Er over the faces F of a 
certain convex polyhedron in a Euclidean space V. 
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The key property we want to point out for this survey is that Arthur’s truncated 
Eisenstein series are automorphic which was not the case with Maass and Selberg’s 
naively truncated Eisenstein series. Arthur’s truncated Eisenstein series greatly sim- 
plified the computation of the Maass—Selberg relations, especially in the situation 
of higher rank groups. The use of differential operators (via Green’s theorem) was 
avoided and all proofs of the Maass—Selberg relations were reduced to algebraic 
computations as in the Rankin—Selberg method [9]. 

In this survey we focus on Arthur’s truncated Eisenstein series for the modular 
group I and its connections with the Riemann zeta function. The reader may find it 
useful to concurrently look at other surveys on truncation such as [4—6]. 


2 Arthur’s Truncated Eisenstein Series 


In order to define Arthur’s truncated Eisenstein series for T we require some 
preliminary definitions. A function F : h — C is said to be periodic if F(z + 1) = 
F(z) for all z € b. 


Definition (Constant Term) For a piecewise continuous periodic function 
F:6 > C define 


1 
CF(z) := i F(z+u) du, (for z € 6), 


to be the constant term of F’. 


Definition (Truncated Constant Term) Let T > 0 and z = x + ty € 6. For an 
integrable periodic function F' : h — C define 


0 if0<y<T, 


C'F(z) := 
CF(z) ify> T, 


to be the truncated constant term of F. 


Definition (Arthur’s Truncation Operator) Let T > 0 and z € 6. For an 
integrable periodic function F' : h — C define 


ATF) = FQ - YD) CF(y2, 
y€Too\P 


to be the truncation of the function F(z) with respect to T. Here 


is the stabilizer of oo under the action of I’. 
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Definition (Polynomial Growth) A periodic function ¢ : h > C has polynomial 
growth if there exist constants C, B > 0 such that 


I6@| < CIm@)’*, — (Im(z) > ov). 


The following theorem summarizes the main properties of the truncation operator 
a, 


Theorem 2.1 (Properties of A’) Let T > 0. The truncation operator A’ satisfies 
the following properties. 


* Let dp € L* (T\b) be a cusp form (the constant term of ¢ is zero) then A'd = @. 

« Let @ : T\hb — C be integrable. Assume T > 1 and z € F. Then 
ATO(Z) = @ — C$(2). 

¢ Assume ¢ : T\b > C is smooth, is an eigenfunction of the invariant Laplacian, 
and has polynomial growth. Then A" € L?(T\b). 

+ Letn,p ¢ L*(T\bh). Then (An, 6) = (n, A"?). 


Proof of Theorem 2.1 Itis clear that if @ is a cusp form then truncation has no effect 
since the constant term of ¢ is zero. Next, assume z lies in the fundamental domain 
F. Then for y € ['/Too, with y ¢ Too, the complex number yz must lie outside of 
F and not in a translate m + F with m € Z. It is easy to see that in this situation, 
we must have 


Im(yz) < 1. 
The second bulleted item immediately follows from this. The third bulleted item is 
much more difficult to prove, so we do not give the details. The last bulleted item is 
easily proved with a change of variables z+ y~!z for y € T/T. oO 


We now carefully examine the action of the truncation operator on the Eisenstein 
series E(z, s) with Fourier expansion given in (1). Then 


CEG, 8) =y' + o(s)y"™, 
and for T > 0, 
CTE(z, 8) = (y+ Pl(s)y'™) - chat. 600), 
where 


1 ifa<y<B, 
char, (y) = 


0 otherwise. 
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It immediately follows that for 7 > 0, Arthur’s truncated Eisenstein series 
ATE(z, s) is given by 


ATE(,3) =E@s) — D> (Im(y9' + 66)ImG"”). (2) 
vy €Too\P 
Im(yz)=>T 


3 Fourier Expansion of Arthur’s Truncated Eisenstein Series 


We now compute the Fourier expansion of A’E(z,s). The proof of the Fourier 
expansion requires the following lemma. 


Lemma 3.1 Let v € C with R(v) > 0. Then 


2+i , (logY)" = 
ae joo pw _ \répy fly, 
2ii Jamiog wi t® 0 if0<Y<1. 


Proof of Lemma 3.1 For St(w) > 0 and t(v) > 0 we have 


—v __ 1 i. —wx .v dx 
w= Tw Jy ex x" 
It follows that 


1 2+i00 yv 1 2+i00 yw 1 lo) Zz dx 
— —— dw = — —-: e'* x" —] dw 
2n1 2—i00 witv 2ni 2—i00 Ww T'(v) 0 X 


1 ia rae | (a Y\" dw dx 
= <i XxX ._— —< — = 
T(v) 0 2ni 2~i00 ex w xX 


eel io , dx — (log Y)” 
~ To)Jo ~ x  FO+d)’ 


It immediately follows from (2) and Lemma 3.1 that 
ATE(z,s) = E(z,s) — E'(z,s) — @(s)E7(z, 1-5), 


where we have defined 


¢ 1 2+i00 T-” 
E* (z, s) := lim mal —— E(z,s+w) dw. 
(z, 5) i. arc ( ) 


Warning: The function E’ (z, s) is not the naive truncation of the Eisenstein series 
that we considered earlier in the introduction. 
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Theorem 3.2 (Fourier Expansion of E’) Let T > 0, and x + iy € 6. Let u(n) 
denote the Mobius function. Then 


1 
[ ET (x + iy, s)e2™™ dx 
0 


charg yr) 2+i00 i 


y- chatyy. oo] (y) + Iai — Joi00 7 Os +w) yr “Wdw ifn=0, 
0 if yT > landn £0, 
| -1 
y Rate 
Is d'—s wo - cos(2ny) dt otherwise. 
re fy (Hy 
yTd2l? <1 
d>1 


Theorem 3.3 (Constant Term of A’E(z,s)) Let T > 0 andz=x-+ ty € b. Then 
the constant term of A’ E(z,s) is given by 


1 
/ ATE(z +u,s) du= (y* 7 $(s)y'*) : (1 _ chatir oo) (y)) 
_ chat, 1/T| (y): ni 


2+i00 T—w 
3 / — (gis + w)yl A o(s)\@(1 —st+ wy") dun 
2: Ww 


—i0o 


Proof of Theorem 3.2 With the Fourier expansion (1), we compute 


1 
[ ET (x + iy, s)eo™ dx 
0 


2+ico w a =——=w 
jim Srila oe (v° + b(s + wy! =) dw n=0, 


2+ico 7- amt a 
oat 2—ioo ma ea Yor 25-20 (a 2 Kepyod (2z|nly) dwn cal 0. 
n 


jim 
=) 


We evaluate the above integrals individually. It follows from Lemma 3.1 that 


1 2+i00 Tw 4 ; 
: SW = 
eb Dari he ara (Y "+ 66+ wy") dv 


1 2+ic0o pP—w 
y+ chat, .(y) + (= / ere a +w)ys™ aw) hates 170). 
2 


220i —ioo 


II 
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Note that if yT > 1, then the integral on the right above is easily seen to be zero by 
shifting the line of integration to the right. This is the reason why we can multiply 
the integral by chat, ,/;(y). This establishes the first case of Theorem 3.2. 

Next, we consider the integral involving the K-Bessel function. To simplify the 
computation we make use of the following integral representation: 


T(s+w) (4x|\n|y)"r"— 2 


Qzlnly) = - [oa 
kK, w—4 en = stw 
‘: Ja 0 (0? + 42?n?y?) = 


It follows that 

i 1 [~ Tw 2a Jy 

im — 

v>0 Qi Jo-jogo wit? P(s + w)f(2s + 2w) 
(2z|n|y) dw 


il 
O71-— 2s—2w(n)|n|Sr™ 2 


Kyl 


2+i00 w 42s+2w_2s+2w—-1l,stw 
1 i. T 2 IU y » qi-2s—2w [njrer—| 


= ji oe 
= 90 27 Jo-iog =W'It? €(2s + 2w) 


= cos(t) 
tw at dw 
0 (P+ 47?n?y?) 


oo 

£ t 

= lim 2(22|n|)5— ly’ Le ) p ae af cos(f) 
v0 = £2s Ti (? + 42n?y 2)° 


1 fo 4n2n2y Ye P 
2ni Joico TV-d?(? + 42n?y?) witv ~~ 


Now, 


; 1 i 4n2n2y v aus 

im —— 

v0 27 Join \ Tl?d?(t? + 4?n?y?) J wit 
4m? ny 1 


1 if ——W—_ > 
= T2d2(12+4n2n?y2) 


0 otherwise. 


The condition 


2n?ny 
>1 
T 02d? (2 + 422n2y?) 
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is equivalent to 


1 
Tea <1 d 0<f <4r’r’y’ —1). 
y ss an = S40 n'y TOR 


It follows that 


jail 
O1—2s—2w(n)|n|st" 2 


i 1 i. Tw a Sy 
im — 
v0 Qi Jo-ing wt? D(s + w)E (2s + 2w) 


K pnt (2z|n|y) dw 


Ss 


—lls ~ s— € —2s 
= 2(2n) ly > ~ |n|? 1 ea dy 23 


2s 
d\n ¢€=1 
yTl2d2<1 
2alnly J seage cos(t) 
lL ay 
— yI>8 > - gis wl) fv nee! cos(2rnyt) 7 
= 2y a |, @an * 
din €=1 
yTd2l2 <1 


If n # O and yT > 1 it is not possible to have integers d,£ > 1 such 
that yTd?¢* < 1. Therefore the above integral vanishes and the second case of 
Theorem 3.2 has been proved. The third case of Theorem 3.2 follows immediately 
from the above. oO 


Remarks In [2], Arthur defined truncated Eisenstein series in great generality. 
The explicit computation of the Fourier expansion of Arthur’s truncated Eisenstein 
series for higher rank groups, such as SL(n, Z) with n > 2, seems out of reach at 
present. This is not the case for SL(2, Z) because of the existence of so many special 
known integral representations for Bessel functions. Our knowledge of higher rank 
Whittaker functions is much more limited at present. 


4 Maass-Selberg Relation for SL(2, Z) 


In this section we compute the inner product of two truncated Eisenstein series 
which is called the Maass—Selberg relation. Arthur’s proof of the Maass—Selberg 
relation just uses the Rankin—Selberg method [9]. It is an algebraic proof based on 
the fact that if a group G acts on a topological space X then the union of all translates 
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g - (G\X) (with g € G) is just the space X. More generally for a subgroup H < G, 
we also have 


U g-(G\X) = H\X. 
g€H\G 


Arthur’s algebraic proof stands in stark contrast to the original analytic proof of 
Maass and Selberg which required differential operators and Green’s theorem. 


Theorem 4.1 (Maass—Selberg Relation) Let s,s’ € C and T > 1. Then 


s+s/—1 s—s! s/s T- s—s? 


sq THOS HO -+$(9) 60) = 
—1 S—S 


(ATEG, 8), E(x, s)\= 


Proof of Theorem 4.1 It is clear that 


(A7E(«, ey: AT(E(*, 5’) 


)S aay (3) 


- | ATE(z, 5) - (EG s) — E(z,5)-d( ET 1—s) 
r\b 


To simplify the computation of (3) we require a lemma. 
Lemma 4.2. We have (ATC, 5), AT(E(«, s)) = (ATC, 5), (E(x, 5"). 


Proof of Lemma 4.2 By (3), it is enough to prove that 


| ATE(z,8)- (FG 8) + b(s)ET(z, 1 <9) dxdy 2% 
I\b 


This can be shown as follows: 


| ATE(z,s) + (EF s') + b(s)ET(z, 1 — a) dxdy 
\b 


dxd: 
> (( Im(y2)"" + #(s’)Im(yz)'~’) - chat. 5 cim(y2)) a 
yeToo\l 
= | ATE(z, (> ys + (s')y!- s! - chat, = o) dxdy 
y€loo do y\b) 


= Il. ATE(z, )( y+ o(s)y'") «chatty oy ow 


= ff t8.9( 9% + FR) cha, 09S 
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This procedure is termed: “unravelling E™(z,s) + 6(s)E"(z,1—s)”. The 
x-integral above picks off the constant term of A’E(z,s) which is given in 
Theorem 3.3 and equals 


(y* F o(s)y'*) : (1 = chatin x) (y)) a chat, 1/T| (y)- Feri 


2+i00 TP—w 
‘ / T (os + w)yls-¥ + o(s)o(1 —st+ wy") ig: 
2 Ww 


—i0o 
Since 
(1 — Chatir co} (y)) . chati. x) (y) = chat, 1/7] (y): chat. 5) (vy) = 0 
for T => 1 the proof of the lemma follows immediately. Oo 


To complete the proof of the Maass—Selberg relation we apply Lemma 4.2 and 
unravel A’ E(z, s). We assume for the moment that (s+ 5’) > 1 and R(s) > R(s’). 
It follows that 


(ATBC*,5), AT(EC*.s")) = (AEC. 5), (E(,s!)) 


[ [- [y —(y + G(s)y"™) chatiy coy 69] -EG@s’) < 


[- [y= 6 + Gy!) ehaty.. 0] (y+ 8") s 


ae ai a ae dy 
is o 7)yl—s T= patty 
[ y (v + b(s)y ) (1 — chat; 0)) 2 


7 fw H(syyr™ Ca + Gy") Chat coy (y) 7 


=| by =t) yar fo $1) (y Coa 1+ Gy") d ly 
pets" —1 : TS ; Ts" -s ; — Tis 
sty—1 We) =F : Oley as are 


This identity then extends by analytic continuation in s, s’ to all s,s’ € C. 


5 A Zero Free Region for the Riemann Zeta Function 


It was proved independently by Hadamard and de la Vallée Poussin in 1896 that the 
Riemann zeta function ¢(s) has no zeros on the line #(s) = 1 (see [20]). Only a 
few different proofs of this are now known, but the most intriguing is the proof, first 
discovered by Selberg (unpublished), that is based on the Eisenstein series E(z, s). 
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Recall the completed Eisenstein series 
E*(z, 8) := €"(2s)E(, s), 


with ¢*(s) = m~2T(s/2)C(s) = C*(1 — s), where E*(z,s) has the Fourier 
expansion 


ok * s * —s sot minx 
E*(z,s) = O*(2s)y' +." (2—2s)y'* +. 2/¥ Doras) [nl 2K,_ 1 rr |nlyerr™. 
n#0 


Now if €(1 + ito) = 0 for some real number fp this implies that 


E* (z, (d+ ito) /2) = a7), O~ity (n) |70|" K ity /2 (270 |n|y)er™™™ 
n#0 


which would be a (not identically zero) square integrable non-holomorphic cusp 
form for SL(2, Z). This is not possible! A simple way to see that it is not possible is 
to consider the inner product (E*( »*, (1 + 2ito)/2), E*( *,(1 + 2ito)/2)) which 
is a convergent integral (because of the rapid decay of the K-Bessel function) 
which must have a positive value > 0. This is a contradiction because by analytic 
continuation in the variable w it follows from the Rankin—Selberg unfolding 
method that 


1 lee) Ly, 
(e*( +. + io)/2), E*(4.m)) = ff Ete + imy/2)- Chm” S = 0 


which vanishes for all w € C because the constant term of E*(z, (1 + ito)/2) is 
zero. 

This method was later generalized by Jacquet and Shalika [12] where they prove 
that all automorphic L-functions of the group GL(n) do not vanish on the line 
t(s) = 1. The Eisenstein approach is very powerful. It is possible to obtain standard 
zero free regions of the form 


c 
a aaa EE Cy 
with explicit constants B,c > 0, for many families of L-functions by use of the 
Maass-Selberg relations following an approach developed by Sarnak [17] in 2004. 
In this regard see [3, 7, 8, 10]. All these recent results specifically require the theory 
of Arthur’s truncated Eisenstein series for higher rank groups. 

To complete this survey, we present a short sketch of Sarnak’s 2004 method in 
the setting of Arthur’s truncated Eisenstein series. The following theorem (proved 
in [17]) is equivalent to a standard zero free region for ¢(s). 
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Theorem 5.1 There exists an effectively computable constant C > 0 such that 


14 21 ——_—__ 
eee ad log (2 + |z\)° 


forallt eR. 


Proof of Theorem 5.1 Let T > 1,0 < 7 < 1 be fixed and t € R with |t| large. The 
main idea of the proof is to evaluate the following integral: 


T' (t,n) := |€(1 + 20)? [ [ |A7E(z, 1/241) = = ali 


in two different ways. 
First computation of Z’ (¢, 7) (Upper Bound): 
We will prove the upper bound 


T'(t,n) <« aoe 


- (log(2 + |t|)? + 2 log(T)). (4) 


To prove (4) note that 


T'(t,n) = |\C(1 + 2it)|? Il, N(z,n) |A7E(z, /2+i9)° aaa a 


where 


N(z,n) = #{y € Too\F | Im(yz) > n}. 


Since 0 < 7 < 1, we have the bound N(z, 7) « ; which may be combined with the 
Maass-Selberg relation 


$(1/2 + iT" + 61/2 + it) 
2it : 


I |ATE(, 1/2+ in| -_ ator) — 2 a/2+in 4 
T\b y ¢ 


This gives the upper bound (4) . 
Second computation of Z(t, 7) (Lower Bound): 
Assume 7 = T~!. We will prove the lower bound 


r tt 
aan OE a ©) 


It follows from Theorem 3.2 that if y > T~! then 


1 
} Foe 6 + ne"™ dx = 0. 
0 
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Recall that 


ATE(, 8) = EG, 5) — E'(z,8) -—O()E'@ 1-5). 


It follows that if 7 = T~! then the nonzero Fourier coefficients of E(z, s) coincide 
with the nonzero Fourier coefficients of A7 E(z, s) for y > n. Hence 


2 
oo lee) 
Ki(27y) | dy 
T'(t,n) >> anim f |e 


m=1 


1 
> ry Yo |o-2ie(m)/? 
ms il 


1 
>> 7 Yo lo-aielP)/? 


Il 
p< i 
PH 


Using sieve theory one may show that |o_2;(p)| >> 1 for a positive proportion of 
primes. We thus obtain the lower bound (5). There is a loss of one logarithm in the 
lower bound because (for large x) the number of primes < x is asymptotic to x/ log x 
by the prime number theorem. 


Theorem 5.1 immediately follows by comparing the lower bound (5) with the 


upper bound (4). Oo 
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On a Cubic Moment of Hardy’s Function 
with a Shift 


Aleksandar Ivié 


Abstract An asymptotic formula for 
T 
i Z(t)Z(t + U)dt (0 <U= U(T) < pi) 
T/2 


is derived, where 


-1/2 


Z(t) := €(5 + it)(x (5 + it)) (t€R), C(s) = x(s)E(1—s) 


is Hardy’s function. The cubic moment of Z(f) is also discussed, and a mean value 
result is presented which supports the author’s conjecture that 


T 
[ ood = or, 
1 


1 Introduction 


Let ¢(s) denote the Riemann zeta-function, and as usual let us define Hardy’s 
function Z(t) as 


1/2 


Z(t) := €(5 + it)(x (G + it)) (t€R), f(s) = x(s)E(1—s). 


We have (see, e.g., [5, Chap. 1] and [6]), with s = o + it a complex variable, 


oe an \ ott l/2 meet 1 
X(s) = 2'x™ sin (5308) P—s) = | — eer AO (1) 
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if t > to (> 0). In fact, one can obtain a full asymptotic expansion for y(s) from the 
classical Stirling formula for the gamma-function. 

The function Z(t) (see [8] for an extensive account) is real-valued, even, smooth, 
and as |Z(t)| = |¢ G + it)|, its zeros coincide with the zeros of €(s) on the “critical 
line” Res = 5. It is indispensable in the study of the zeros of €(s) on the critical line. 

The moments of Z(t) are of interest, but since |Z()| = |¢ G + it)|, only odd 
moments represent a novelty. Korolev [14] and Jutila [10, 11] showed indepen- 
dently that 


T T 
i Z(t) dt = O(T""4), / Z(t) dt = 24(T"), 
0 0 


thus establishing (up to the value of the numerical constants which are involved) 
the true order of the integral in question. This improves on the author’s bound 
O,(T'/4*®), obtained in [7]. 

However, higher odd moments of Z(t), where one also expects a lot of cancella- 
tions due to the oscillatory nature of the function Z(f), remain a mystery. 

In [8, Eq. (11.9)], an explicit formula for the cubic moment of Z(t) was presented. 
This is 


[20 dt = anf 2 > dx(n)n~6 cos (3xn3 ae a) ae Or), 
3 3/2 ene (L)3/? 
(a) <ns(Z) 
(2) 


where as usual d3(n) is the divisor function 


dx(n)= D1 (k.£,m.ne€N), 


k€m=n 


generated by €7(s) for Res > 1. 


Remark I Here and later ¢ denotes positive constants which are arbitrarily small, 
but are not necessarily the same ones at each occurrence, while a(x) <, b(x) (same 
as a(x) = O,(b(x))) means that the |a(x)| < Cb(x) for some C = C(e) > 0,x = Xp. 
By a X b we mean that a < b < a. The symbol f(x) = 24 (g(x) means that both 
lim sup f(x)/g(x) > 0 and lim inf, ‘f (x)/g(x) < 0 hold. 

X00 a 


The motivation for this investigation is the following problem, first posed by 
the author in Oberwolfach 2003 during the conference “Elementary and Analytic 
Number Theory”: Does there exist a constant c with 0 < c < 1 such that 


T 
/ Zt)dt = O(T*)? (3) 
1 
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This is Eq. (11.8) of [8]. So far obtaining (3) with any c < | is an open problem, but 
if one considers the cubic moment of |Z(f)|, then it is known that 


T T 
T(logT)* « / |Z(t)|° dt =| I¢(4+in)P dt « T(logT)", (4) 
1 1 


which establishes the true order of the integral in question. However, obtaining an 
asymptotic formula for this integral remains a difficult problem. The lower bound 
in (4) follows from general results of Ramachandra (see his monograph [15]), and 
the upper bound is a recent result of Bettin et al. [2]. 

A strong conjecture of the author is that 


T 
i Z(t)dt = O,(T?/***). (5) 
1 


Note that (5) would follow from (2) and the bound 


2 d;(nyerin Ke N23 +E. (6) 


N<n<N’<2N 


It may be remarked that the exponential sum in (6) is “pure” in the sense that 
the function in the exponential does not depend on any parameter as, for example, 
the sum 


= nit = a gitlogn el <N< ty, 


N<n<N’/<2N N<n<N’<2N 


which appears in the approximation to ¢ G + it) (see, e.g., Theorem 4.1 of [5]), 
depends on the parameter t. However, the difficulty in the estimation of the sum 
in (6) lies in the presence of the divisor function d3(n) which, in spite of its simple 
appearance, is quite difficult to deal with. 


Remark 2 A natural question is to ask: What is the minimal value of c for which (3) 
holds (c = 1 + é« is trivial)? In other words, to try to find an omega result for the 
integral in (3). A related and interesting problem is to investigate integrals of Z(t) 
with “shifts”, i.e., integrals where one (or more) factor Z(t) is replaced by Z(t + U). 
The parameter U, which does not depend on the variable of integration ¢, is supposed 
to be positive and o(T) as T — oo, where T is the order of the range of integration. 

Some results on such integrals already exist in the literature. For example, Hall 
[3] proved that, for U = a/logT,a < 1, we have uniformly 


sina /2 


T 
/ Z(t)Z(t + U) dt = T logT + 2Qy —1—2z)T cosa/2 
0 


aT 1/2 
+ O| —~+T'logT }. (7) 
log T 
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In (7) y = —I''(1) = 0.5772157... is Euler’s constant. Jutila [12] obtained 
recently an asymptotic formula for the left-hand side of (7) when 0 < U « T'/?. 
Bettin [1] evaluated asymptotically a related integral, namely 


T 
/ CU/2+U + itt ($—V—it) dt 
0 
under the condition that 
U=U(T)EC, V=V(T) EC, HeU « 1/logT, Ne V « 1/logT. 


Shimomura [16] dealt with the quartic moment 


T 
/ Z(t)Z2(t + U) dt, (8) 


0 


under certain conditions on the real parameter U, such that |U| + 1/logT — 0 as 
T — co. When U = 0, Shimomura’s expression for (8) reduces to 


T ‘i 
1 
y eg +a)tar= | Z(t) dt = =5T log’ T + O(T log’ T). (9) 
0 0 uy 


The (weak) asymptotic formula (9) is a classical result of Ingham [4] of 1928. 
For the sharpest known asymptotic formula for the integral in (9), see the paper 
of Ivié—Motohashi [9]. 


2 Statement of Results 


The first aim of this work is to investigate another integral with the shift of Z(t), 
namely the cubic moment 


T 
/ Z(t)Z(t + U) dt (10) 
T/2 


for a certain range of the “shift parameter’ U = U(T) > 0. We shall prove the 
following 


Theorem 1 For0 < U= U(T)< T'/2-€ we have, uniformly in U, 


T 
/ Z(t)Z(t + U) dt = O,(T?/4**) 
T/2 


9 
+2n ie » h(n, U)n'/6+9? exp(—3xin?/? — i/8)}1 + K(n, un. 
T,<n<To 


(11) 
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Here (d(n) is the number of divisors of n) 


ey 
= (12) 
Vv 8103 8x3 


h(n, U) = nS" d(8)8", Ty = 
5|n 


K(n, U) := adU?n 27 +--+ dg Un 73 +. OCU 9 2 DP) (13) 


for any given integer k = 2, with effectively computable constants dz, d3,... . 


Remark 3 When U — 0-+, the expressions in (11)-(13) tend to (2), since the 
integral is real and a h(n, U) = d3(n). Thus Theorem 1 is a generalization 


of (2) for a relatively wide range of U. 


Remark 4 By methods similar to those used in proving Theorem | one can deal 
also with the “conjugate” integral 


Tr 
i FC4+UZ0da W=<0eT?™), 
T/2 

Remark 5 Analogously to (5) one may conjecture that, uniformly for0 < U < 


T1/2-€ 


T 
/ Z(t)Z(t + U) dt = O,(T?/***). 
0 


The second aim of this work is to demonstrate, although (5) and (6) seem at 
present intractable, that it is possible to show that the conjecture (6) holds in a certain 
mean value sense. To this end let 


S(a,N):= > ds(nye""” — (wR), (14) 


N<n<N’S<2N 
so that the exponential sum in (6) is $(3z, N). Then we have 


Theorem 2 Every finite interval [A, B] (0 <A < B) contains at least one point C 
such that 


S(C,N) «, N?? log?/? N. (15) 


3 Proof of the Theorems 


We begin with the proof of Theorem 1. Suppose that 0 < U < T!/?~* and let 


z 1/2+iT 
aa [Zoz+ U) dt == / CP(s)E(s + U(7(s)x(s + iV) ds. 
T/2 1/2+iT/2 


(16) 


104 A. Ivié 


The procedure of writing a real-valued integral like a complex integral is fairly 
standard in analytic number theory. For example, see the proof of Theorem 7.4 in 
Titchmarsh’s monograph [17] on ¢(s) and Jutila’s work [12]. It allows one flexibility 
by suitably deforming the contour of integration in the complex plane. Incidentally, 
this method of proof is different from the proof of (2) in [5], which is based on the 
use of approximate functional equations. 

Let 


2 
h(n, U) := nS" d(5)8", No = NAT, U) := (——. (17) 


5|n 
so that h(n, U) <, n°. We have 
T= +h, 


say, where 


1/2+iT 
hss j Y h(n, UP) xls + UD)” ds, 


U J1/2+iT/2 n<No 


1 pi/2+ir ; _ ; seccetila 
=. / 1p VEILS HIV) — YO hl OP DxUs + HWY) ads 


i 


1/24iT nNo 
(18) 
and we shall keep in mind that for Stes > 1, since d(n) is generated by £7(s), 
Co le) Co 
h(n, Un = $7 Y° dK (kim) (kim) = 67 (s)E(s + iU). (19) 
n=1 k=1 m=1 


To be able to exploit (19) let henceforth c = 1 + e. Then, by Cauchy’s theorem, 


1 ctiT/2 1/2+iT ctiT 1 
h=-7 / +f +f = —-V+44+ 43), 
U 1/2+iT/2 ctiT ctiT/2 l 


say. For T/2 <t < T, $<o0Sc,s=ot+it,0<U«K T'/2-* we have 
see ( 2 \ 20 t2it-1 on o+ittiU-y 1 
2 : am 3it+iU 
+iU)=e4 {| — 1+ 0{-)}, 
Ponc+in=e (SF) (ZS) eeti+o(7)| 


(20) 
so that uniformly in U 


Foyer) = Pre. C=7T 


On a Cubic Moment of Hardy’s Function with a Shift 105 


This follows from the asymptotic formula (1), and actually the term O(1/t) can be 
replaced by an asymptotic expansion. We also have the convexity bound (see, e.g., 
Chap. 1 of [5]) 


[j<akrer" Cesc). (21) 


Stronger bounds than (21) are available, but this bound is sufficient for our present 
purposes. 
By using (20), (21) and the trivial bound 


> h(n, U)n- Ke aor ear T3-30)/2+e (i <ox< c) ; 


n<No 


it follows readily that 


G 
Ji tJo Xz / POPS ee (22) 
1/2 


In J3 we have s = c + it, T/2 < t < T. Hence 


T 
J3 =i 2 A(n, un f 
T/2 


n Woe 3x i/8 ( 
n>No / 


ct+it—1/2 
=) 


(2c-+2it-+2iU—1)/4 
a hag U eGittiU)/2 Gy 
20 


+0 es |A(n, unpre) 


n>No 


The expression in the O-term is clearly <, T?/4*®, and it remains to estimate 


ir 
> h(n, U)ne 1/2 i K(te© dt, (23) 
T/2 


n>No 


with 


t \c-l/2 fp 4 U\ &-D/4 

K@):= (—) = K_ Tatté 
20 20 

forc = 1 + «. In (23) we have 


t+U 
Qn 


t 
20 


Sr(t) = tlog + 5(t+ U) log = 3t- tlogn, 
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t t+U 
"(t) = log — 
In) = log 5 on 


1 1 3t+2U 1 
+ = oe (24) 
t 2t+vU) 2tt+U) T 


+ 5 log — logn, 


in) = 


In the sum in (23) set n = [No] + v, v € N. By the second derivative test (Lemma 
2.2 of [5]) the contribution of the term v = 1 is < T®~!/4. The terms v > 2 are 
estimated by the use of the first derivative test (Lemma 2.1 of [5]). For v >> No it is 
found that [f/()| >> 1, while for v < No one has, for a suitable c > 0, 


N 
Vito] = tog MEY 5 tog (1 + ev79") > vr? 
In WY 20 


Thus the contribution of the terms with v > 2 will be 


Ke PUT YT + DEY IAIN + v, Ube ® max el 
2<vKNy VN T/2<t<T 


K- T3/4te Noy re! us PS pe/2-l-e 
25v<KNo v>No 


Ks piste 


since No x T?/2. 
It remains to deal with /;. We have 


T 
i _ | > h(n, U)n 2 9@1U/? gin dt + ONT) 
T/2 


n<No 
= pW /24-3ni/8 y- h(n, U)n"/?_g,,(T) + O.(T7/4**), (25) 
n<No 
say, where 
T . 
JAT) = / ell at, (26) 
T/2 


The exponential integral in (26) has a saddle point f,, the root of the equation f/(t) = 
0, which occurs for 


P(t, + U) = 827 n’. (27) 


On a Cubic Moment of Hardy’s Function with a Shift 107 


It is immediately seen that t, =< n/3 x T, and that Eq. (27) is cubic in ¢, for a 
given n and U, and thus it is not easily solvable. However, its solution can be found 
asymptotically by an iterative process. First from (27) we have 


ft = 8x7n? + O(UT), 
which yields 
tn = 2nn?/? + O(U). (28) 


As we require T/2 < t, < T, from (27) we infer that N; < n < No, where No is 
defined by (17) (this is the reason for the choice of No) and 


a2 | (T/2)°(T/2 + U) 


8703 


Setting n = [N,] — 4,1 < ww < MN, and proceeding similarly as in the estimation of 
the sum with v > 2 after (24), we obtain that the contribution of n < N; to J; in (25) 
will be O,(T?/4+*). Since Y%,(T) « T'/? [see (31) and (32)] it transpires that the 
condition N; < n < No may be replaced by T; < n < To, where To, T, are defined 
by (12). In this process the total error term will be O.(T*/4**). Therefore, for the 
range 0 < U < T'/?~*, we have uniformly 


= e U/2 937i /8 > h(n, U)n-'/?_ 9,(T) +4 OFT), (29) 


T| <n<To 


and the range of summation in (29) is easier to deal with than the range of 
summation N; < n < Np. At this point we shall evaluate %,(T) by one of the 
theorems related to exponential integrals with a saddle point, e.g., by the one on p. 
71 of the monograph by Karatsuba—Voronin [13]. This says that, if f(x) ¢ C[a, b], 
b 21 if (c 
/ et) dy arin OT + O(AV') 
a f"(c) 


+ 0( min(lf'@I"', /a)) + O(min(f")". Vb), Gd) 


where 
0<b-aK<V,f'(c)=0,a<cK<b, 
OSA. FO <A I OSA VO, 
We shall apply (30) with 


f(x) = 


fr(x) 
= 


1 
ae C= a= sf, 8=7, Vif @S = 
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so that A = T,f® (x) « T~?, f(x) « T~3, which is needed. The term O(AV~') 
in (30) will make a contribution which is O,(T?/4**), and the same assertion will 
hold for the other two error terms in (30). This will lead to 


iy pte 
i _ oe U/2 637i /8 » h(n, U)n ert —_ + OAT), (31) 
T| <nX<To iw) 


Since, by (24) and (27), 


a th tn U 
en = exp \i(t log 5 + 5(t + U) log we) — ty — tn logn)} 


L(t, +U ty + U 

= exp { itn log nn +) =F 5Uilog We) - ith — itn logn} 
(m+U) 3. 

= exp {} iU log ———— i Sit} 


iU/2 
= mn (uy 


it follows that 


if =0,(T*/4**) + D7 @ 1/2 97/8 


a AO). op ee 
Y a.) SS er) . 32 
ee aU) at, ou On (32) 
1<nX<Tp 


In view of (27), (28) andn x T?/? we have 


pole 2ty(t, + U) cae 2(27)3 
3t, + 2U ty(3t, + 2U) 


= J2Qn)3n?4! (3-1 3 o(u/T)) 


=¥3 (20)3n 2((8n'n) 1/3 4 olu)) + O(UT75/*) 
= 2fa nu + O(UT~*'4), 


It remains to evaluate 


t, + U\U? 
ote) ; (33) 


A := A(n,U) = | 
20 
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To achieve this we need a more precise expression for #, than (28). Putting (28) 
in (27) we first have 


f= 823n* — ie = 827? — 407 Un*? + OU"), 
th = (82° n? — 4x2Un*/3)' (1 + O(U*n *?)) 
= 2a? —1U + O(n). 
This yields the second approximation 
ty = 2nn?!? — 5U + O(U?n-?!?), (34) 
Inserting (34) in (27) we shall obtain next 


U2n—2/3 


n= 2a? — ly Ua + o(usn-*/). (35) 


In fact, (35) is just the beginning of an asymptotic expansion for ¢, in which each 
term is by a factor of Un~?/? of a lower order of magnitude than the preceding one. 
This can be established by mathematical induction. Then we see that (c’s and d’s 
are effectively computable constants) 


2-2/3 


ae Unt 4p O(U4n-°/3) 
127 


3 U 
logo = if — $U- Qn) + bu + 


U U2n-2/3 
1 2/3 

Ly tog (12 + 2 

gee ae Ber 


= —3rin?? + 2iUlogn + icgU?n!? + icsU?n4? + O(U4*n-"), 


and therefore 
A= ni e3ain? {1 +4 idbU?n 2/3 a id,U?n43 +4 O(U‘n*!*)} . (36) 


and we remark that the last term in curly brackets admits an asymptotic expansion. 
Since U « T'/?-* by assumption, U?n~?/? «< T~*° and thus each term in the 
expansion will be by a factor of at least T~~* smaller than the preceding one. One 
sees now why the interval of integration in (10) and (11) is [7/2, T] and not the 
more natural [0, T], because the terms djUin7i/ 3 become large if n is small, and 
K(n, U) is then also large. 

Inserting (36) (as a full asymptotic expansion) and (33) in (32), we obtain the 
assertion of Theorem |, having in mind that we have shown that 


lb| + |b] «. TI". 
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To prove Theorem 2 note that, with S(@, N) defined by (14), we have 


B 
B 
: |S(a,N)|? do = / 2d, A(n) + S> dslm)ds(nye""—"™) | dor 
. A ménXN 
d3(m)d. 
= (B—A))\d3(n) + O 2 ra ; (37) 


nxN mx#nxXN 


Note that the function d3(n) is multiplicative and d3(p) = 3. Thus for Nies > 1 
and p a generic prime we have 


Seen = I] (1 As d3(p)p* +4 B(p?)p-> + +) 


n=1 Pp 


= £°(s) [ [a -p*y TI (1 +4 9p 4 d(p?)p > fie -) 
P P 


= 0°(s)F(s), (38) 


where F(s) is a Dirichlet series that is absolutely convergent for %es > 1/2. Thus 
the series in (38) is dominated by £°(s) and it follows, by a simple convolution 
argument, that 


S- dz(n) = C)xlog’ x + O(xlog’ x) (39) 


nx 


for a constant C, > 0. 
Next, by using the trivial inequality 


lab] < 3(@ +b*) 


and (39), it is seen that the double sum in (37) in the O-term is, by symmetry, 


< NIB > d(m) - d;(n) 


|m — n| 
m#nXN 
= 2N"? S° dz(m) De ical 
mxN n#m, nr 


« N47 log? N, 


since, for a fixed m such that m = N, 


1 
> jn—m| < log N. 
n—-m 


n#mnxN 
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From this estimate and (39) it follows that 
B 
/ IS(a,N)|? da « N* log? N, (40) 
A 


which implies the assertion of Theorem 2. A further discussion on the cubic moment 
of Z(t) is to be found in Chap. 11 of the author’s monograph [8]. 


Remark 6 An interesting problem is to determine the true order of the integral 
in (40). 


Remark 7 If the divisor function d3(n) is not present in the definition (14), namely 
if one considers the sum 


T(a, N) := ba erin? (a #0), 


N<n<N’<2N 


then the problem is much easier. By using Lemma 1.2 and Lemma 2.1 of [5] it easily 
follows that 


T(a,N) <|a|1N'?. 
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Some Analogues of Pair Correlation of Zeta 
Zeros 


Yunus Karabulut and Cem Yal¢in Yildirim 


Abstract By modifying Montgomery’s calculation of the pair correlation of zeta 
zeros, we derive analogous results. In this article we work out the correlation of 
zeta zeros with the relative maxima of the zeta-function on the critical line, the 
pair correlation of these maxima and the correlation of zeros of two Dirichlet 
L-functions. In each case the relevant Riemann Hypothesis is assumed for obtaining 
the results. Several auxiliary results necessary for the calculations may be useful in 
problems about the zeta-function. 


1 Introduction 


The Riemann zeta-function ¢(s), for the complex variable s = o + it with real 
numbers o and ¢, has all of its so-called non-trivial zeros in the strip o € (0, 1), and 
the number of these zeros with ordinates in (0, T] is 

T 


T T 
Ne(T) = log — — — + O(logT). 1 

(1) = 5— log — ~ = + O(log T) (1) 
The Riemann Hypothesis (RH) is the statement that all of these zeros are on the 
critical line o = 5 In [17] Montgomery, starting from an explicit formula (given 
in (18) below) in Landau’s Handbuch [16] and assuming RH, elicited and defined 


the expression 


4 
4+(y-y) 


Fyg@,T):= x0"), (>), (2) 


O0<y.y<T 


calculated its values, and broke new ground by the results, conjectures and observa- 
tions concerning the distribution of zeta zeros on the critical line and relations with 
the distribution of primes which were deduced therefrom. Here y and y each denote 
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the ordinate of a non-trivial zero of ¢(s), and am is a suitable weight function. 


From the definition (2) it is clear that Fg¢(x,T) = Fez (4, T) (so that one can 
consider only x > 1), and that 


co 


2 
Fee(x, T)= =| 
—0o 


Montgomery’s calculation gave 


T log’ T 
Fr e(x,T) = Ine 1+0 


dt > 0. (3) 


xiv 
dL isey| 


O0<y<T 


T 
)) + — logx + o(T logT) + O(@logx), 


20 
(4) 
as T — oo. In [17] Montgomery expressed his results in a different form which is 
related to (2) by setting 


log T 


x=", T>2, aER (5) 
and 
[ meh hie 
F(@) = F(@,T) := (= log r) > THO—Pw(y — 7), (6) 
20 - 
0<y,y<T 
so that the result read 
F(a) = (1 + o(1))T~™* log T + @ + o(1), (7) 


as T — oo, uniformly for 0 < a < 1, along with F(—a) = F(a). (With 
further effort involving the use of a sieve upper bound for prime pairs, Goldston 
[8] improved the error term O(x log x) in (4) to O(x), as had been pointed out by 
Montgomery, leading to the uniformity up to aw = 1; see also [12]). 

By convolving F(a) with appropriate kernels, Montgomery deduced that asymp- 
totically at least z of zeta zeros are simple, and the existence of infinitely many 
gaps between consecutive zeta zeros of size at most 0.68 times the average gap. 
This has since been reduced to 0.51... by several researchers; achieving an upper 
bound below 5 will have important consequences in number theory (see [15]). By 
drawing upon conjectural estimates for twin primes, and also upon the g-analogue 
of an expression which comes up in his calculation, Montgomery conjectured that 


F(a) = 1+ 0(1) (8) 
uniformly for @ > 1 in bounded intervals. Trivially F(a) < F(0) for any a, and by 


Dirichlet’s theorem on Diophantine approximation, there are large @ = a(T) such 
that F(a) becomes arbitrarily close to F(0) ~ log T (which follows unconditionally 
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from (1)), so the restriction to bounded intervals is essential for (8). We note that 
such large values of F(q@) can occur over intervals of length at most Olmert): This 
follows from the upper bound in Goldston’s (see [9]) estimates that for any « > 0 
and sufficiently large T 


c+l 2 c+l 8 
/ Fla) da > 5 —<«, F@)dw= 7+, (9) 


—1 


uniformly for any real number c (which may be any function of 7). From the 
conjecture (8), Montgomery deduced the so-called pair correlation conjecture 


B sin tu \ T 
» low [o-( ) du + 6a.) —logT, (10) 
” TU 20 


O<y,yST 
~_ 2. 
<y-7s 5 


20a 
logT 


as T — oo. Here 6(a, 8) = 1 if 0 € [a, B], 6(a@, B) = O otherwise. As for the 
até 


converse direction, Goldston [10] showed that (10) implies / F(B,T)dp ~ 6 


for any fixed numbers a > 1 and 6 > 0, ie. that (8) holds on average. From 
either (8) or (10), it follows that almost all zeta zeros are simple. EJ. Dyson 
remarked to Montgomery that the integrand in (10) occurs as the pair correlation 
function of the eigenvalues of random complex Hermitian matrices in the Gaussian 
Unitary Ensemble, and Montgomery interpreted this being in accordance with the 
view (which legend dates back to Hilbert and Polya, see [20]) that there is a 
yet undiscovered linear operator whose eigenvalues characterize the zeros of the 
Riemann zeta-function. Montgomery’s work [17] has been very influential for the 
developments in the theory of the Riemann zeta function (and other zeta functions 
as well) and formed the basis of the connections with random matrix theory. For a 
comprehensive survey on this topic and relations with the distribution of primes we 
refer the reader to Goldston’s article [11]. 


Brief Statement of Results In this article we modify Montgomery’s argument in 
such a way that we can obtain some analogues of the pair correlation of zeta 
zeros. We begin with a brief sketch of Montgomery’s method. Then in Sect.3 we 
present an alternative method for correlating zeta zeros to any sequence of real 
numbers. The first application, in Sect.4, is to Montgomery’s original problem. 
(The contents of Sects.3 and 4 were already announced in [24]). Upon some 
preliminaries concerning x(s) (defined in (12)) and the relative maxima of ¢(s) 
on the critical line in Sect.5 and concerning the iterated convolutions of the von 
Mangoldt function in Sect.6, the method of Sect.3 is applied in Sect. 7 to the 
correlation of zeta zeros with the maxima of zeta on the critical line. Corresponding 
to F(a) ~ a fore < a < | —€ from (7), we find in Theorem | in Sect. 7 that 
Fez, (a) ~ a — 2a7, (« < a < 1 —€). Here Z;(s), defined in Sect. 5, is a function 
which vanishes at the relative maxima of ¢(s) on the critical line. For this purpose 
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a Landau—Gonek type formula involving the zeros of Z;(s) is developed in Sect. 7. 
In Theorem 2 in Sect. 8 we obtain the values of the pair correlation of the maxima 
Fz,,z,(a@) for 0 < a < 1 —e, to be followed by values of F¢.z, ¢.z,(@), and then 
deduce some results on small gaps. In Sect. 9 we adapt our alternative argument to 
obtain in Theorem 3 the correlation between the zeros of two Dirichlet L-functions. 
Finally in Sect. 10 we mention some possible ways to continue this research. 


Some Basic Formulae Before embarking on the calculations we recall that the 
Riemann zeta-function satisfies the functional equation 


G(s) = x(s)oU —), (11) 
where 
eiGe 
110) = 20m)" — 9) sin) = (12) 
Logarithmic differentiation of the functional equation gives 
: (9) = (9) - : (1-3), (13) 


From (12) and some basic properties of the /"-function, we see that 


/ r’ r 
(s) =— r (+5 tan = +log2z = —= (1-8) +5 cot + log 2r. (14) 
Since 
rT’ 1 1 
—(s) = logs—-—+O|{—], (\s|=>6>0, |args| < 7 —5), (15) 
rT 2s |s|? 
where 6 > 0 is arbitrarily small but fixed, we have 


i 1 
76) = ~log St +0 (;) , (t| = 1, say; 0) < 0 <0»), (16) 


where 0; < 02 are any fixed real numbers. Under the same conditions one also has 


(<) So «pr, =v. (17) 
ds} x 


Some further facts concerning €(s) and y(s) will be given in Sects. 5 and 6. 
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Notation Throughout this paper, 7 will denote a large number tending to oo, € will 
be a fixed positive number which can be taken arbitrarily small and which may not 
have the same value each time it appears, and letters with indices such as C; will 
denote sufficiently large positive constants of fixed value. We write N = Z* U {0}. 
We will employ the Iverson notation that for a statement S, [S] = 1 if S is true, and 
[S] = 0 if S is false. 


2 A Sketch of Montgomery’s Derivation 


The explicit formula from Landau’s book [16] is that, if x > 1 and x 4 p* (p prime, 
and k a positive integer), then 


= — CO 2p 
a l—s x8 yrs 


tLigge 8 - 


=1 


provided s # 1, s # p, s # —2r. Here pe runs through the non-trivial zeros of (s), 


Xe 


and the series over p is convergent subject to the interpretation lim 
U—oo p— 
|S p|<U 


Upon assuming RH, Montgomery deduced that 


xiv i y\ boo tit x\ ott 
(Qo -1)>° ia (Sa0(2) + 40(*) 


7 @=3) +¢= 


nSx n>x 
1 
a4 \ t-o+it (Qo = 1)x2 
aR RG TT RT 
gests (20 — 1)x72" 
= 1 
2 BS it?) 


r=1 


valid for o > 1, and all x => 1. In this formula Montgomery used (13)-(16) to 
replace F(1 —o +it) by -£(6 — it) —log(|t|+2)+ O() = —log(|t| +2)+0,() 
(for s in a fixed strip in o > 1), and he put upper-bound estimates for the last two 
terms of (19). Montgomery tooka = 3, squared the modulus of both sides, and then 
integrated both sides over t from 0 to T. From the expression thus arising from the 
left-hand side of (19), he discarded those y ¢ [0, 7] within an error of O(log? T), and 
in order to evaluate the integral he extended the range of integration to i within 
an even smaller error. In this way the left-hand side of (19) led to the expression (2). 
To carry out the integration of the square of the series involving A(n) coming from 
the right-hand side of (19), Montgomery resorted to Parseval’s formula for Dirichlet 
series from [18]: 


lo) T 2 
If y n\a,|* converges, then i > a,n" 
n 


n=1 0 


dt = > |an|?(T + O(n)), 


The result of this calculation was (4) (or (7)). 
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3 A Modified Approach 


With o = 3, (19) reads 
Axi i , —3+it < S+it 
a ea cerae =-x '(Dam(2) +d am(*) 


Y 
: 3 hay 4x? 
-5(- ; +it\x OTHE a. ; 


oo —2r 


1 xX 
—4x72 
* pe ear emer 


r=1 


Using 
@ i te J 
ee Os Wee j=p0=e20 


L{/I'({3-o+it F o —it 
(FS) +F(4S4)) oo 


which follows from the formulas in Chap. 12 of [2], witho = 3, we have 


Ayilv—1) : i 
a oe = —x? ~ A(n)n2—* ao y A(n)n7 27 
Y —_— 


y nsx n>Xx 


w( (5 — i) — log *) 
SEH FE-9) 


4} = . (2) 
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Using (15), and doing elementary estimations, we can simplify (21) into 


Ayi(v—)) 2 3_it 2 —2~it 
> ——, = -x > An? —Xx Yo A(n)n ? 
pee =o 


nsx n>Xx 


+x *(log(|¢| + 3) + O(1)) 


xo x3 
~~ ——_— 1). 22 
+ (avin) + (ita) (x21). (22) 


When ¢ is let to run through a set we sum both sides of (22) over this set of values 
of t. This will be feasible if one can calculate the sums over f, in particular 2 foal 


t 
where p is a prime and a is a natural number. 


4 Pair Correlation of Zeta Zeros 


We apply our method first to the quantity (2) considered by Montgomery. So, letting 
t run through those ordinates y of the zeros of the Riemann zeta-function which are 
in the interval (0, 7], from (22) we have 


dyiv— y) - 3 if A(n) —i 
Le Lee 


Y O<y<T nsx 0<y<T n>x 0<y<T 
as 1 1 
+x? SO (087 +000) + 0{! > 7) 
0<y<T 0<y<T 


oe y ;) (23) 


0<y<T y 


for x > 1. From the count of zeta zeros with ordinates in (0, 7] given in (1), we 
easily estimate the last terms of (23) and re-state (23) as 


Axil?) , ‘ An) 
=-x_ A 2 ~ x2 
a 
Y O<ysT nNSx 0<y<T n>Xx 0<y<T 
felled (ee ae + O(x2) (24) 
Qnx? log T as 


In the sum on the left-hand side we can exclude those y ¢ [0, 7] within an error of 
O(log? T), by a standard calculation based upon (1). 
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We now have recourse to Gonek’s [13] uniform version of Landau’s formula 


T 
x yea 5 AM) + O(y log 2yT log log 3y) (25) 


O0<y<T 
; y . 1 
+ O| log ymin { T, — ] }] + O{ log 27 min | T, —— 
(y) logy 


for y, T > 1, where p denotes a complex zero of ¢(s) and (y) denotes the distance 
from y to the nearest prime power other than y itself. This formula is unconditional. 
Assuming RH and using (25) for the inner sum occurring in the first two terms of 
the right-hand side of (24), the contribution from the first term of the right-hand side 
of (25) is 


‘a ae A(n)* 
a 2S nA t+?) > 3 


nsx n>Xx 


T 2 - 3 
ae [e (; log x 5 + Ox? log? 2») 


rt 
ye (SE + pe toud 2 log? 2»)| 


9x2 4x2 
T ..t 3 
= — logx+ O(Tx 2 log? 2x). (26) 
20 
Here we have calculated the sums from 
S~ A(n) = x + O(e? log? 2x) (27) 


n<x 


which is an expression of the prime number theorem under RH. The first error term 
in (25) contributes 


a ay n* A(n) log 2nT log log 3n + x7 yee ee 2nT log log 3n 


nsx n>Xx 


< x log 2xT log log 3x. (28) 


The second error term in (25) contributes 


1 A 
< 4D nA(a) lognmin( Tn) +2? > A tognmin(7 1) 


n<x n>x 
x2 : 
Xxlog2x + 7 logT - x<T, (29) 
T logx if x>T. 


Finally the last error term of (25) contributes 


ae Pe is 2 A(n) log 2T log T 
=) na : 30 
< => “ ee oe nm logn > log 2x ee) 


nSx n>x 
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Combining the above we find that 


Fe+(x,T) Eee tie (—_\ ye 4 G1) 
x,T) = — logx 
es 2x? log T In 
T log? 2. 
+O(xlog(xT) log log 3x) + O (te) 
x2 
as T — oo. This gives an asymptotic result for 1 < x = (a). 


5 Preliminaries Concerning the Relative Maxima 
of |5(4 + i0)| 


We shall be concerned with the function 
' Lz’ 
Zi (s) = O(s) — 2 8) (32) 


which was introduced by Conrey and Ghosh [1]. We easily see the relation 


1 
Z (5 + i) | = |Z'(0| (33) 
with Hardy’s Z-function 


Z(t) = (x = ie t ery (34) 
a 2 


which is real for real f and satisfies |Z(t)| = |(5 + it)|. Taking derivatives of both 


sides of (34), and observing that LG + it) is a real number, we see that 


Zayagt (; : ) | (35) 


Taking imaginary parts in the partial fraction formula (from Chap. 12 of [2]) 


— Ve i tr ps i .i 
Greg eee or oe — 


(here y, denotes Euler’s constant), we have 


oe yt \ @ Tet Wwe 1 
me ae > a oe 4.2 eer (37) 
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Now we assume RH, and use (15), to obtain 


As a (lo \ fl a 
50 =98(5 i) = 3 o(7)+ Dao (38) 


y>0 


This formula reveals that as 5 + it crawls up on the critical line from a zero of ¢(s) 
to the next zero, there will be only one point where Z’(t) = 0. Hence we obtain that 
on RH, the zeros of Z’(t) are interlaced with the zeros of Z(t) (the case of possible 
multiple zeros of Z(t) can also be included in this statement), and |C( 5 + it)| has 
exactly one maximum between consecutive zeta zeros in the upper half-plane (with 
the symmetrical configuration in the lower half-plane). 

We shall need to know the zeros and poles of Z;(s), but before that let us consider 
the same for y(s) and its derivative. From (12), since the J’-function never vanishes, 
it is clear that the zeros of x(s) can only be those zeros of sin(4*) which are not 
cancelled by the poles of (1 — s), ie. 


{s; x(s) = 0} = {0, —2, -4,...}. (39) 

Similarly, we see 
{s; sisapole of y(s)} = {1,3,5,...}, (40) 
and that all zeros and poles of y(s) are simple. By Rolle’s theorem there are the real 


zeros of x'(s) between the zeros of y(s). A little computational examining using (14) 
reveals that 


1 
{s ER; y’(s) = 0} = {ea,1—Ke3£ eZ}, K€ (-}-9) , Kz € (—3, —2), 


K3 € (—5,—4), Ke € (—20, -—20 + 1) for £ > 4, 


Ke > —2L as loom. (41) 


These zeros are all simple and the symmetry with respect to the point 5 is due to 
x x 
—(s) = —(—s) (42) 
x x 


which is obvious from the functional equation of ¢(s). To look for non-real zeros of 
x’ (s), we use 


rt 1 ae | 1 
eee Carer (-s ¢N) (43) 
in 
oe 1T' s 1r’ (1-s 
© (9 =t0g4 - 57 (5)- 55 ( 2 oo 
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to see that 
x a ( 1 1 
SA(+n= - (45) 
x a (52+n) +) G++ (5) 


except at the poles. Thus the zeros of ’(s) can occur either when t = 0 (and these 
have already been identified), or when o = 5. It is customary to write 


1 ; 
- (5 + i) =e 70, (8(1) ER), (46) 
where 
~ 1 it t _t t t 1 1 7 
eid € - 5) ae 5 18 (57) 2-81 4° S708 | 
(47) 
(see Sect. 6.5 of [4]). From (46), we see 
/ 
us (5 + i) = —20'(t) ER, (48) 
x\2 


so that, by (43) and (44) we have 


oo 1 
20/(t) = n © (5 +i) - (loge +ye)-7- = + : ; a 
Xx 2 a + 2 n G + n)2 a (5)? 


n=1 


(49) 
This implies 
00 1 t 
tn) 
w=) (Gj 7 EB a7 > 0 (> 0), (50) 
n=0 ((} +n)" + (5)’) 
i.e. 0’ (2) is strictly increasing for t > 0. We find from (14) that 
"(1 
29/0) = —£ (5) = -(v. jigua a. 2p log 2) we —5.37..., (51) 
x\2 2 
and we know from Sect. 6.5 of [4] that 
w(t) >0, (t= 10). (52) 
We deduce that 
1 
¥(s)) =0,s¢gR => sa 5 + it, foraunique t € (0, 10) (53) 


(t must be near 277, since from (47) we see 20’(t) = log(s>) + O(4)). 
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Hall’s paper [14] contains thorough information about the zeros and poles of 
Z,(s) (Hall denotes our Z;(s) by F(s)). We see that 


{s; s isapole of Z;(s)} = {0, 1, 3,5,7,...} (54) 


(the pole at s = 1 is a double pole coming from the simple poles of ¢(s) and of 
x(s) at s = 1, the simple pole at s = 0 arises from y(0) = 0, and the simple poles 
at s = 3,5,7,... come from the poles of y(s) at these points). Hall, assuming RH, 
showed that all non-real (which will be termed as non-trivial) zeros of Z,(s) are 
on the critical line, so that, assuming RH, the non-real points where Z)(s) vanishes 
are the relative maxima of | G + it)|, and the multiple zeta zeros if ever these 
exist. Some information about these have already been given in the beginning of 
this section. Hall also proved assuming RH that the number of the zeros of Z| (s) 
(counted according to multiplicity) with ordinates in (0, T], Nz, (T), is 


y 


1 Z 3 
Nz (T) = Ne(T) — 7s) — 2 (55) 


provided that T is not the ordinate of a zero of €(s) or of Z)(s). There are real zeros 
of Z;(s), all of them simple zeros. These may be termed as the trivial zeros of Z(s). 
The trivial zeros are placed symmetrically with respect to the point 5 because of the 
functional equation 


Z\(s) = —x(s)Zi0 — 5), (56) 
which follows from (32) and (11). We have 


{s; s is a trivial zero of Z,(s)} = {zo,ze,1—z3£ € Z*}, 


1 
2 = 521 € 3,5). € (5,7), € QE + 2,26 +3) 


for € > 3. (57) 


Since Z,(s) doesn’t have an associated Dirichlet series, we will need a Dirichlet 
Ul 


polynomial which approximates 7) ino > 1, |t| => t. From (32) we have 
1 


(58) 


Here we use the estimates (16), (17), and 


G) . 
=) &liesisei, G Stee =i, S39, (59) 
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which depends on RH and follows from Corollary 13.14 of [19] by a standard 
application of Cauchy’s estimate, so that we can write 


Lig Ore ie 6) ( 1 ) (60) 


Zi + (5) |t| log |¢| 


Expanding the denominator as geometric series, we see that 


log |r| 


1 Licging iT ne 2 t 
a a (-; ato) i 


TO k=0 


C, log |t| log log log |t 
+0 |qrtexp 1 log |z| log log log |¢| . 
log log |¢| 


Now, for k € N ando > 1, we write 


/ k SAN (n 
(-S0) => 4% (62) 
n=1 
/ " (k) n 
(fw) =) =: oe ed (63) 
C 4 n=1 fe 


Using (61)—(63) in (60) gives the following Dirichlet polynomial approximation (the 
series begins with m = 2 since A**T)(1) = A*®(1) = 0). 


Proposition 1 Assume RH. In the region 1 < 0 < 09, |t| = to, we have 


Leet ni k 
1 2 
Zi =) a (k) (k+1) 


k=0 


C, log |t| log log log |t 
+O Fe exp 1 ey | 0g 108 e | : (64) 
log log |¢| 


Remark We were led to the approximation (64) because the idea of replacing —< X(s) 
by log 4 3, in (32) and then developing an approximating Dirichlet series, as was igh 
in [1], ends up giving a weaker (valid only for x up to T3-€) result than Theorem | 
in Sect. 7. Unconditionally, we have £(s) <<; (log t)3 (log log Ir|)3, (o > 1, |t| = 
to) at our disposal (see [21], Sect. 6.19), causing the error term in (64) to become 


O(|t|~3 +9), and this in turn also leads to a smaller range in Theorem 1. Anyhow, 
results of the type we are pursuing make more sense when RH is assumed. 
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6 Preliminaries Concerning the Iterated Convolutions of the 
von Mangoldt Function 


We shall need in Sects. 7 and 8 some properties of the arithmetic functions A“ (n) 
and A“ (n). We note that the article [5] by Farmer et al. contains some related results. 
First, proceeding inductively and denoting Dirichlet convolution by *, we see the 
trivial bound, for k > 1, 


A®(n) = AY « AG) = Yo AMAKY (<) < (log ny! ¥* Ad) = (logn)*. 


d\n d\n 
(65) 
Next, using (65), we see that 
AM(n) < (logn)*t?, (k = 0,1,2,...), (66) 
by dint of 
ae +> Aa(n) 
AM = AW * Ay, where ; (s) =: > == (hs > 1), and )7 Ag(d) = (logn)’. 
7 
n=1 d\n 
(67) 
” y / , ps 
Note that by comparing coefficients in (5) — (¢ (s)) + (£0) we have 
A (n) = Ax(n) = An) logn + AP (n) = A(n) logn + A(a)A(=). 
d\n 
(68) 
It is easy to calculate that for a €¢ Z* and p a prime number, 
Ag (p") = (2a — 1) (log)? (69) 
a-1 
A® (p') = : _ ) (logp)*, (70) 


A® (pt) = >(.¢ ) - ¢ ')] (log p)**?, (71) 


Here we are employing a convention that for u, v integers which are > —1, 


1 if r=s, 
r 0 if s=-—landr>0, 
() a 0 if r<s, (72) 
r! ’ 
ay otherwise. 


Now we give two lemmas which will be used in Sect. 7. 
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Lemma 1 Let k € Z*. We have, forN = 1, 


AM (N — log log 3N + 7 
> AN Wn) K (log any log log 3N + Tk (73) 
- n (k—1)! 
n<> 
=2 
The same bound holds with A®(N + n) in place of A“ (N — n). Here 
k-1 
(Ky) =] [ K+. (74) 
j=0 


with the convention (K)o := 1. 
The k = 1 case of this lemma was shown by Gonek [13] within his proof of (25). 
Proof Let 

S(x) := | {N-x<n<N: A 40} | 


By (65) we can write 


A®(N =n) 2 dS(x) S(x) 


N 
nx z 


N N 
2 rm iL sa 
1— i= x2 


We now need an upper bound for S(x). To this end we quote the following result of 
Tudesq [22]: 


y \ x (loglog 3x + 6);-1 
ay , 
Picondite log 2x (j— 1)! 
@(n)=j 


where 1 < x < y, j € Z* and the constant implied by < is absolute. Employing 
this we have 


k 


k 

x (log log 3x + 6);-1 x (log log 3x + 7)x-1 

S(x) < 1 = ; 
@=), » 1« ere G—1! log2x (k—D)! 


j=l N-x<n<N j=l 
a(n)=j 


since 


k 
a= ee) (kEN, reR). (75) 
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This upper bound for S(x) gives the claimed result. The sum having A“ (N + n) in 
its summand can be treated in the same manner. Oo 
By the same line of reasoning we have 


Lemma 2 Letk € N. We have for N = 1 


A®(N —n) 47 (log log 3N + 7)x42 
a « (log 2N)*t? 7 
ge (k+ 1)! ve 


N 
nx 7 


The same bound holds with A (N +n) in place of 1 (N — n). 
The rest of this section will be used in Sect. 8. We define 


CO 


; , k " b 
3 Aieis _ (~ «) (So) ,, d=berl (77) 


n=1 


so that (cf. (62)-(63)) 
A(k,0;n) = A(n),  A(k, 1;n) = A" (n). (78) 


We are going to study the sum Y> Aki. 41,2) A (ke, 2,0). The result will be 
nsx 
Proposition 2, and its corollary will be needed in Sect.8. We begin with the 


following Lemmas. 


Lemma 3 For prime p andk € N, we have 
k(log p) A(k — 1, t;n) 


A(k,t;pn) = 4 +[: = 1](logp) [2A(k + 1,0;n) + (logp) Ak, 0;n)] if (p,n) = 1, 


O ((k + 1)(log p) (log pn)k+?*“!) if pln. 
(79) 


Proof The k = 0 case can be verified easily, so let k > 1. First take 1 = 0. For 
(p,m) = 1, by (78) and (70) we have 


min(a,k) 


A(k,0: pm) = AP(p'm) = YO (") ren arrem 


r=1 


*\ (k\ fa-1 rien 
>a | eae (log p)’ AN" (m). (80) 


r=1 
With a = | this reads, for (p,n) = 1, 


A(k, 0; pn) = A“ (pn) = K(log p) A“ (n) = k(log p) A(k — 1,0; n) 
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| r—1 
which is the statement of (79) for this case. Using (' ) < eon in (80), 
r— r—1)! 


by (65) we have 


k qa} 
A(k,0;p¢m) = A® (p'm) syi(h Ne Gap logy down) 


k 

k= 1 7 -)-G— 

< koe (|) oer ‘(log my@—P—D 
r=1 


= k(log p) (log p’m)". 


This completes the case 1 = 0 and p | n. 
Now we consider the case 1 = 1. For (p,n) = 1, we have 


A(k, 1; pn) = 2 (pn) => A® (=) As(d) 


d\pn 


= = 4%(F Ja. +e A; )A2 (pd) 


d\n 


In the last member of this equation, applying the already proved « = 0 case to A 
in the first sum, and using (68) in the second sum we obtain 


A(k,t:pn) = klogp) > A® ae ) Ar(d)+2(logp) >> sete )A@ 


d|n d\n 
+(logp)° AM (n), 


which gives the statement of (79) for this case. 
For the last case with p | n, letn = p'n’, (n’,p) = 1, r € Z*. Observe that 


A(k, 1; pn) = 2 (pn) = S> A® (=) aoa) 


d\pn 
n 
= DAG )A2 (d) + ee 5) Ax (pHa) + A®( 1) Ao(p" 1), 
d'\n! 
d’>1 


Using (65), (69), 


Ax(p"*!d’') = 2(log p) A(d’), (d’ > 1, (d',p) — 1) (81) 
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which follows from (68), and the case 1 = 0, we obtain 


A(k, 1; pn) < k(logp) (log pn)*“! S* A2(d) 


d\n 
+2(log p)(logn’) Y* A(d') + (2r + 1) (log p) (log n’)*. 
d'\n' 
Now the assertion in the case 1 = | and (p,n) > 1 is immediately seen. Oo 


We now begin examining the sum > A(k, t; bn), where b is either 1 or a prime 


x 
nS; 


number p < x (clearly, the sum is void if b > x), fork < 
A(k, t; bn) = 0). The results will form 


logx ;: logx 
jog? afk > joe?” then 


Lemma 4 Let b be 1 or any prime number p < x. For1+[b#1J<k+e< 5, 
there exists a constant Cy > 1 such that 
Cc x 
A(k, t; bn) < ———*——-~ (log x41, 82 
do AC ") < Ey apie Oe (82) 


x 
n< 5b 


Proof First we take b = 1. We have 


2 x(1 k+2i-1 
y A(k, t;n) ~ ee a (x > oo, any fixedk E N,v € {0, 1},k +1 > 1) 
t—1)! 


nsx 


(83) 


as a generalization of the prime number theorem, since 


a” k gH ts 2'x(log x)kt2“! 
rev-i| (-£e) (00) =} (kt 2-1! »(KEN,v € {0, 1},k +2 > 1) 
(84) 


(the details of the proof of (83) are similar to the zeta-function based proof of the 
prime number theorem as in [2]). However, we will also need to work with k not 
fixed. We want to show that 


C : 
> A(k, UF n) S eau ape 1 (85) 


n<x 


with some absolute constant C3 > 1 by doing induction on k. The induction basis 
cases are covered in (83). Starting from the definition in (77) we see that 
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S- Ak + Lun) = Yd 4(Ke “) A(d) = \ AW Y* AK ce) 


nSx nsx d\n dx e<4 


d 
Cy A(d) x\k+2I-1 
< ; 1 
= Ge aie d (108 5) 


d<x 
Gg" kt? 
< ——3___x(] ee 86 
= (e+ 21" og x) (86) 
where we have used 
A(n) x\i — (logxyt! 
= (log “) = Fa tO (does), GEN) (87) 


nsx 


which can be seen from Mertens’s classical result via partial summation (this result 
is included in Lemma 5.5 of [5]). This proves (85). 
Next, taking b = p a prime number, we examine the sum > A(k, 1; pn). Let £ 
nse 
be the unique integer such that x € [p’, p°*'). First we examine the cases k +1 = 1 
which turn out to be special. For (k,1) = (1,0), the sum is = A(pn). The non- 
n<* 


=p 


1 


zero summands are from n = 1,p,p?,...,p‘~!, and the sum has value £logp € 


G log x, log x], so that 


Y > ACL, 0; pn) = D> A(pn) < log x. (88) 


x x 
nx = nx 2 


For (k,1) = (0, 1), the sum is > A2(pn). The non-zero summands are from n = 


nss 
1,p,p’,...,p'! with total contribution (7 (log p)? € (+(log x)’, (log x)?], and from 


TFA) 


n= p'p\ <= with a prime pi # p and r > 0, r, > 1. By 


Ax(p}' ps’) = 2logpi logp2, (ai,a2 = 1), (89) 


which is seen from (68), and 


A(x):= S> logp<Cyx, (x>0) (90) 
psx 
p: prime 


132 Y. Karabulut and C.Y. Yildirim 


(with Cy = 1+ <a according to Dusart [3]), we have 


Y> Aa(p'*'p!') < 2logp > > YS og 


rill ex r>=0 r}>1 A 
ne es nis(- —)7 
< 2c.t0er 5 (Z5)" 
70 r1>1 
<Cs logy) a = < Co" logp, (91) 
r>0 


1 
where we have made use of (“ a ) " > 2 for non-void sums so that r; can be at most 
log 41 bata : 

i ae . We have chosen to use the majorization (90) because if we try to evaluate 
all of the sums asymptotically we will get quantities containing the factor log = and 
this will necessitate examining the results according to the size of p relative to x and 
thus further complicate the matter. So, we express the result for this case as 


) > AQ, 1: pn) = De Ax(pn) < ar log p + (log x). (92) 
nS; n<% 3 
We wish to obtain an inequality of the type (85) by induction on k + ¢, but from (88) 
and (92) We see that in the k+ 4 = 1 cases the power of log x won’t match the power 
in (85) if or = o(p), p < x. So, for the induction basis we take k + 1 = 2 which 
has the two cases (k, 4) = (2,0) and (k, 1) = (1, 1). 
For (k,t) = (2,0), the sum is » A (pn), differing from > Ax(pn) only in the 


n<* n< 
—P —~P 


contribution from pn = p, p*,...,p. This contribution, A) (log p)’, vanishes for 
p > ./x. Hence, from (92) we have 


Y > A(2,0; pn) = }> A (pn) < Cs log x. (93) 


n<* n<* 
=p P 


Now take (k,1) = (1,1) in which case the sum is >A (pn). The non-zero 
nss 
summands are from n = p,p?,...,p°—! with total eonwpanon HE DCE) dog py 
Pyll 2 


by (71), and from n = pp; (r = 0,7, = 1) andn = p'p|'py,(r = 0,171,172 = 1) 
where p, p; and p2 are distinct primes. We have, by (67), (69) and (89), 


AY (pty!) = (4r + 1) (log p)? (log pi) + (471 — 3)(ogpi)*(logp) (94) 
A (p'*! pip?) = O(log p) (log p1) (log p2). (95) 
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Then, estimating similarly to (91), we have 


SY. AMD pt) < ogp? S4r+.) >> SS logy 


p’py' <3 = = 7 
=p 
r>0,r1>1 ris( 47)" 


+(logp) )> 0 (4r1-3) 2 (ogi)? 


r>0 r1)>1 


< C,4(logp) YS \(4r + vd(; ay 


r>0 r>1 
BE Be 
r| x a 
+C,(logp) ¥* S(4r1 -9(F :) toe (=) 
r>0 r}>1 P 
< Co(log p)” Y4r+ D5 a 
r>0 
1 
4r,-3f[ x \" 
+Cio(log p) (log x) )> S* — ( =) 
r>0 rj >1 r P 
x 
< Ci log p)(logx), 
and 
PS AO (pitt pl?) < < 3(log p) > pa logpi > ~ log p2 
pip pe <* r20 zt raz 1 y 
120, r.n>I i <( 4)" : ns(aeyr) 
> 
<Cr(logp) ¥> SY) loge > (sa Fp =)" 


r>0 r2>1 


n= incon)! 
s Cisx(logp) > pri » > ees 


= a 1 Pi 


nis(ser) 
< Cy4x(log p) ye oF << , (log p1) are 


r>0 Wes meee 


x 
< tis, Nosy) een): 
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Putting together these calculations, we have for the case (k, 1) = (1, 1) 


y> AC, 1: pn) = D2 Dn) = Cr6= (log x)’. 


n<* n<* 
Pp 


basis for establishing 


n<* 
ae 


Ci, k+2i-1 
Y> Atk, tpn) < Ed Dip 12”) 


with some absolute constant Ci7 > 1. Letting pn = de, we have 


YAK + Lepn) = 4 (ke 4) s@ 


nst d|pn 
= Yo A@M)Y AK. 42) + 0 A®M YD AK ce) 
ds; e<i d<x ex<3 
(p=! ple pid 
= Yo Ad Yo Ak upe) + D> Ad’) = A(k, te). 
ds 2 es od d<* 3 esa 
(d,p)=1 


(98) 
The last two double sums can be majorized by using the induction hypothesis 
and (87). We have 


Y> AM Yo Atk, u pel 
d<* 


Cc A(d) x\k+2-1 
1 
L4 = GyuRDip ae d (loz 3) 
(ap)=1 er 
Cir 


Z x ( (log x)*?! 
~ (k+21-—1)!p 


k+2i-1 
+2 + Moe») )) 
and, since A(pd’) < A(d’) 


om A(pd') x 

l 
sae eae ae | 
Chr 


k+21-1 
tae bs 
=p 
- x (log ae 
~ (kK+21-—1)!p 


o(( k+2i-1 
(k + 20) deen) 
Using these in (98), the induction step is completed 
Next, we prove 


aor, pd’ 


dX A(pd’) pe A(k, te) < as 


oO 


(96) 
Formulas (93) and (96) are of the same type as (85), and this furnishes the induction 


(97) 
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logx 
log 2’ 


Proposition 2 For 0 < kj,ko < t,t2 =Oorl, kj +4, kk +b => 1, we have 


Sky korn) = D> Alki, un) Ak, 2:1) 


nsx 
P(k1, ka, 1, t2) 


~ (ky + ky + 2t; + 2. - prose 


Chtk, log x ky kg +20) +222 
iso : (99) 
(max {k,, k})! 
where 
ky! if gy =2=0,ki =k, 
2- (max{ky, k})! if q—-mb=+1,k,} -k = Fl, 
max{ky, k2})! if gq —-m=+1,k) -ky = F2, 
P(ky, ko, 1,02) 2= ae ») ; 
(ky +2)!4+4-(, + D!4+2-h! if gqeun=1lkh=h, 
2+ (max{k,, ko} + 1)! if qye=u=lky—-ky=+1, 
0 otherwise. 
(100) 


Proof We will do induction on k; + kz + t1 + l2, for which the least value is 2, and 
there are three cases of this to be considered for establishing the induction basis. 
We have 


S110) = )) An)? = xlogx + O(x), (101) 
S1.00.1&) = So..1.0(x) =) A(n)Ao(n) (102) 
= by [Aw logn + A(n) y A(aA(=)| = x(logx)* + O(xlog x), 
n<x d\n 


since the first term in brackets contributes x(log x)? + O(x log x), and in the second 
term only proper prime power values of n contribute, and this contribution is 


O(x2*¢). 
The final case for the induction basis is 


So.0.1.1) = DAM)? = YF Aa(n)? 


nsx n<x 


2 
=) sien? + 2A(n) logn )) A(d) A(“) se (X awa(')) } 
d\n 


n<x d\n 


136 Y. Karabulut and C.Y. Yildirim 


The first term in the summand leads to x(log x)? + O(x(log x)”). In the second term 
of the summand, only proper prime power n contribute, so that this term leads to 
O(x2**), The contribution of proper prime power n in the third term is bounded 
by the same error term. The third term is 0 if n has more than two distinct prime 
factors, so it remains to consider the contribution of n having exactly two distinct 
prime factors in the third term. The set of n = p,p2 < x, p; < p2 contributes to 
the third term (we need not exclude the cases p, = p2 since these give a negligible 
contribution, and we take all pairs of p; and p> and then halve the sum; we will use 
this setting also in other cases below) 


1 
5 D, Zlogpilogp2)’ = 2) | (logpi)” )) (log p2)” 


D2<X x x 
P\P2= PiS3 pose 


=2)° (logpi)? (= =e @ eS (;;)) 


PIs} 


1 2 
= 2xlogx Ss (og pi)” 
P\ 


PSs 


(log pi)” aa 
— 2x oa = : + O(x(logx)*) 


= 2xlogx (5 coe. + o(log»)) 
—2x (5008.9 +O (‘og 
= prlosay + O(x(logx)’). 


The third term also gets contributions from n = p|'p3.pi # p2.11 = 1,172 = 2, but 


do dE dogpy? 2 YY dogp2)? « SF 2 ven » (= y log x 


ry=lylle 2 >2 n21) r>2 
Pi <j a 
< 
mel)? 
Cogpi)* 
< x2 log x y y 
n21pi! <j pe 


iT 2, 
=x? log x + Moe ri: < x(logx)*. 
P\S} Pi 
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Hence 
a 3 2 
So0.11) = 37 Ulogx) + O(«log* x). (103) 


The formulas (101), (102) and (103) obey (99) and (the first, the second and the 
fourth cases, respectively, of) (100), and so the induction basis is established. 

In dealing with the cases k, + kp + 41 + l2 = 3, we will first consider the cases 
when one of k; and kp is 0, say k;} = 0 so that 4; = 1. There are four such cases: 
So,1,1,1(%), So,2,1,0(%), So,4,1,0 With kp > 3, and So4,,1,1(4) with ky > 2. 

We begin by considering 


Sona) = SAMA a) = Yo Ar(n)(A * Ar) (0). 


nsx nsx 


Here only those n which are either proper prime powers or which have exactly two 
distinct prime factors contribute. Using (68)-(71), the contribution of proper prime 
power n is 


32) P= Ya Ila ~ 1° ogp)> «Kx? 


prsx psx 


Using (94), the contribution of n = p;p2 terms is seen to be 


1 
2 » A (Pipr)A (Pipa) 


Pip2Sx 


1 
= 5 D> @logpi logps)| log p1 (log po)” + (log p1)* log po 


P\pr2Sx 


=2)) (losp.)*(— log = ¥ o( as )) 


x 
P\S3 


1 A I a 
= 2xlogx > sere — 2x y ee + O(x(log x)*) 


Piss Piss 
_i 4 3 
= grilles) + O(x(logx)”), 


As for the contribution of those n having two distinct prime factors of which at least 
one is a proper prime power, we have 
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SY AOD PRAY (Dp?) K Y= riogp1)7 (log p2)*+ r2(log p1)? (log p2)* 


Py! py Sx Pil py Sx 
rnz=lrm=2 ry>lr>2 
2 
«yn 2 (logpi)? D> a (log p2) 
ry>1 pi! r2>2 


+> Yo dogp)* Yor . (log p2)* 


21 plex 2 
n2T pl <F Lp eas mo<(r) - 
| 
1 ily 
2 x 2 
«yin 2 (logpi)? )> log | + 
nz1 pil< r>2 py PI 


4 1\ 2 
x \? 
+ > YS dogp) Dalz -)’ log (=| 
nzlpi<s r>2 Py 


< x2 (log x) » r| » (logpi)* 


nzl p<} Pe 


+ x3 (logx)? D> 4 2 (og pi)” << x(log x). 


rj>1 


HA 


Hence we obtain 


So111@) = x(1083)* + O(x(logx)*). (104) 


Next, using (68)—-(70) and (89), we consider 


So2,1,0) = D5 AO (MA (n) = YF Ar) AM (0). 


nsx nsx 


Only those n having at most two distinct prime factors contribute. We have 


D5 20") A (p') = D5 Gr — 1)(r— 1) dog p)? 


pl sx pl <x 


< (logx) x era) De log p 


r>2 p<xt 


< (log x) > rx? Kx? log x, 


r>2 
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and the contribution from 7 with exactly two distinct prime factors is 


oe Ao(pi'p3) A (pip?) = 2 > (log p1)*(log pr)” 


72 Tr r2 
oe "py Sv Py Py Sx 


a ee RERI Ds Y> dogp2)? 


>1 a5 
p< <3 MN? n<(2 x)? 


=2 D> (logp,) >» (= "te :((2)’) 


piss n=l 


+o((2)') 


1 

x 1fx\2 

=2 (log p1)? lo (=) ~(=) 
Ds spi) log pi es pi 


if tee 3 > 
Py S35 m2) 


=2 (og) toe ( -) 


x 
PY S39 


(= +0 (=) : + O(x(log x)?) 
Py Pi 


= = 2xlogx )> > fen 


ry>=1 


ps4) 
(log p1) 
Sy De 
r= caitt , ted 
pis(3)" 


+ of! a wen) + O(x(log x)*) 


2 
pisk Pi 


2 
= 2x(log o( 4 on) 
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- ( Coes? 4 ou) + O(x(logx)”) 


1 
= zillog.x)" + O(x(logx)’). 
Hence we have 


So2,10(%) = prog." + O(x(log x)’). (105) 


Now consider Sox10(x) = SSA? MAP(H) = YAMA), (k = 3). 
n<x nsx 

Because of the presence of A(n) only those n which have at most two distinct 

prime factors contribute to the sum. Only those prime powers n = p’ with r > k 

will contribute since A“ (p") = 0 for r < k. We see, by (69) and (70), 


Y> A2(p')A(p') = Yr - n((- ) (log p)**? 


psx rk 


rok PSxr 


wo ae wie 
< YS @r- 1) is 1) (=) (log x)**! > logp 


r>k al 
~ psxr 


< Dos ax ? (log x)k+! < at (logx)**!, 


r>k 


In calculating the contribution of n with exactly two distinct prime factors we use 


k-1 
k\[m-1 r2—-1 F a 
Wl. ; (log pi) (log p2)"/ for ry + r2 > k 
a eau ery 


0 for r} +12 <k, 
(106) 


and without loss of generality we take r; > a We have 


p> Ag(pi' pz) A™ (pip? 


Pi! Px <x 


k-1 
k 7 r| 1 2 ay — 
eel) ot. a) YS dog pit! log po) IF! 
mil NIT LYNE) nD 
rj >max(j,5) Py Py SX 
rg =k—j 
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k~l k io} a 
<>» () x ei D3 dogpiyt! 2 (ogp2)* i+! 
j=l 


G-DEk-j 


ry=max(j.5) pi S3 72 
1 
fi Pat Ly ki 1 
a 1 x \72 x \2 
< (log pi! log (=) (=) 
(i) x a k-j-D! a ( pi Py 
ry =max(j, £ fi 
m>k 
k i 1 x 2 
- >( ) = a? 72 tospi) (ls (Fr 7)) as (Fr) 
j=1 J a 5V Pi 7 Py r2=k-j ? J * \P 


= , 1 

k rw x \\U(*«\F 

<> » (logp,yt! (io (=)) (=) 

(a «kj! ,c a et A 
os 5 


pale 


— [k A = 
«it (\oo Pe Oe (oer (+) 


j=1 " ax(j,. & salt 
J r1}>max(j, 5) pi<(4)71 


2 aa 
co pa Dy, Moem) >) () (=) | 


n> k > <G )7r a jx&min(r) .k—1) 


= Cfox(log x)! 
<__max 0 Ds Ss (ospi) er an 


1<j<k-1 
NES gels s)7r 


since the sum over r; starts from r; > LS) > 2. Combining these, we obtain 


C2 x(log x)” 
So4,.1.0(%) = 0 (Gate we) } (ko = 3). (107) 
om 


By (66), this also settles 


Sodora@) = YA (MAP M) = Yo Ara (n) «K (log.x)? }* Ar(n) A (n) 


n<x nsx n<x 
C2 l ky +2 
=o (see , 23), (108) 


and there remains the case 


So211@) = 5A (MAP (A) = YF Aa(m)A(n). 


nsx nsx 
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Only those n with at most two distinct prime factors contribute to the sum. By (69) 
and (71), the prime powers contribute 


> A274 (7") = D> 2r— »((; - @ ') Jase’ « Yor dlogp)® 


pl <x pl <x > 1 
ae a le? peer 


< \ (log (x? xT ) log p  (logx)° eee < x3 (log x). 


r3 p<xt r>3 


To estimate the contribution from those n = p''p; pi # po, first note 


A (pp?) = (ri — I(r — 2) (log pi)? og p2) + (72 = 1)(r2 — 2) (log p1) (log p2)* 
+ (8rir2 — 3r) — 3r2 — 2) (log p))? (log p2)”, (109) 


as can be found from (67)-(71), so that 


pa Aa(p' pZ)A (pip?) 
Pi Py sx 
«K D> (n= D1 = 2)ogp,)*(log pr)” + (r2 = 1)(72 — 2) (log pi) (log p2)* 
Py py <x 


+ (8rr2 — 3r, — 3r - 2) (log p;)° (log p>)’. 


The first term here contributes 


«Yori >) dogpi)* > > ogp)? 
ne? piles mel 7 vd 
ns(3r)’ 


1 


« dort dS dogp.)4 Yt (= -)" Y>  logpr 


ry>2 pis r=) 


bar 
< (log x) > et (log p1)* yo“ Ur ) <« x(log x) si ry ps fee 
ry>2 pi<% r2>1 ry>2 


1=2 pis yt 


Ay2 
< x(logx) Yo rj (108 (3)" ) 3 =P &xtloex) we =e leg? 


ry22 I rz) Pl 


pris(3)"™ 


< x(log x)". 
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The treatment of the second term is the same. The third term contributes (since for 
non-zero contribution we need r; + r2 > 3, we can take r; > 2,12 > 1) 


< > > (log p1)° YS 8rin — 3r,; — 3r2 — 2) 2 (log p2)* 


22 pi <i r2>1 a 
n=(r) 
8rir2 — 3r) — 3r.—2 
< (log x)? > s (log pi)° >. =e 5 . > log p2 
r22pil<k n> "5 de 
n=(r) 
1 
8ryiro — 3r, —3r.-2 [ x \r 
K (logx)?? » (logp:)? ) > —=——=> ( =) 
nN2pl< r2>1 ry Py 
lo 
« x(log x)” Yon > (log pi)” Be 
n=! m<(4) 
1 2 
2 X\ 7 <r 4 EPI 
< x(log x) vn (toe (5) > < x(log x) Doe ~~ 
ry>2 2a nz2 Pl 
pis(3)” 
< x(logx)*. 
Hence we have 
So21,100) & x(log.x)*. (110) 


All of the formulas (104), (105), (107), (108) and (110) conform to the claim of 
Proposition 2, so the treatment of the cases k, + ky + ly + ty = 3 with one of k; and 
ky being 0 is completed. 

From now on we take k, + ky + ¢) + lo => 3 with 1 < k; < ky. Since 


Alka, lan) = A(k—1,4;—), 
(ka tain) = Y(logp)A( ko — 1.0: =) 


p|n 


where p denotes a prime, writing n = ep’, we see 


Skxtour®) = (logp) > A(ki.t1:ep")A(ko — 1,125) 


psx eS ir 


= SJ (log p) ¥> Alki, 41: ep) Alke = 1, 2: ) 


<x e<i 
a= —P 
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x 
e<Hr 
r>2 
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+ J (logp) ¥> Aki, u13 ep") Ake — 1, 2: ) 
psx 


e<* 
=p 


= ) (log p) }> Aki. 11: ep) Ak = 1,12: e) + Be, say, 
psx 


e<t 


= J \dogp) S) Alki, 41: ep) Ake — 1, 12:8) 
psx 


(e.p)=1 


e<t 


+ “(ogp) D> Alki.usep)A(ke — 1, 0:e) + 3s 
psx : 


(e.p)=p 


e<i 


= )dogp) S*> Alki,1:ep)A(ko — 1,ta:e) + Ey + Up, say, 
psx 
P 

(e.p)=1 


Now we use Lemma 3 for A(k;, (1; ep), and have 


(111) 
YS \dogp) SY) Alki.u1; ep) A(ko = 1, ue) 
psx : 


e<* 


=k) (ogp) D> Aki - 1 usep) Ale - Lue) 
psx 5 
noi 


e<i 


+ 2[ = 1] )“dogp)? Y> Alki + 1,0; 2) Aly — 1,02: e) 
psx E 


(e.p)=1 


e<t 


+[u =1] > dogp)? S> Alki, 0;e)A(ko = 1, 2:2) 
psx 
(e,p)=1 


=k, ) (logp)? }° A(ky — 1,415 €) Ake = 1, 125) 
psx 


x 
e< ? 


x 
e< Fa 


+ 2[1 = 1] )“dogp)? S° A + 1,0; e)A(ko = 1, xe) 
psx 


e< 
=P 


+ [u = 1} dogp)? S° Alki, 0; e)A(ka = 1, 252) 
px 
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—k, Y(ogp)? > Alki = lure) Ake — Lease) 
psx 


x 
esp 


—2[u = 1] )dogp)? > Alki + 1,0;e)Alky — 1, 12:8) 
psx 


(e.p)=p 


ext 


— [ur = 1) dogp)? S$) Alki, 0; e)A(ky = 1,123) 
psx 


(e.p)=p 
= d+ 2 4+ Y3 — Ly — Ys — Ye, say, (112) 
so that 
Sky katy to (0) = Ly + Ly + Y3 — Ly — V5 — Ve + L7 + Ly. (113) 
Applying the induction hypothesis to %', 2 and 3 we have 
31 =k) (logp)” D> Aki — 1.3 €) Ale — 1,05) 
psx ext 


ky P(ky = 1, ky = 1, ¢1, ¢2) 


l 2 ky tke +2, +2t2—-3 
: s (log p) (108 =) 
(ky + ky + 2t, + 2. — 3)! a 2 P 
of 208°" 57 ose) 
max(k; — 1, kp — 1)! 


x ky +ko +20) +212—4 
(toe ) ; 
psx P 


P 
Dy = 2 = 1d dogp)? SY Ak + 1,0;e)Alk = 1, 1956) 
psx 


P(ki + 1,k2 — 1,0, 
= 2[e, = 1]x Vig be 42) 


(log p)? x ky +kp +21; +2t2—3 
log ) 
Git +2 29-3 2) ( 
Chithe 
+O (« = 1] weg 


P 
1 2 ky +kp +201 +2t2—4 
> (log p) (toe =) 
pox ?P P 
33 = [4 = 1] ) ogy)? }> Alki, 0; €) Alka — 1, 12:8) 
psx 


max(k; + 1,k2 —1)! 


ext 
SG siG P(ky, ky — 1,0, t2) 
1 


(ky + ky + 214 + 2y = 4)! 


xe (log p)? ( 


5c ky +kp +20, +2t2—4 
log ) 
pax =P P 
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4 A f _ 1 ort y (log p)? log x ky ky +20; +2t2—5 
; max(k,, k2 — 1)! D 


pax P 
(114) 


We must mention that some care is needed in ensuring the applicability of the 
induction hypothesis. In these applications we need to check whether the hypothesis 
of Proposition 2 is satisfied. Specifically, for X’; we need to have k} —1+ 4; = 
1, ky —1+l) > 1. If these are not satisfied, then either k, + ty = 1 ork; +; = 1. 
Recall that we are working under the condition ky +ky)+, +l. => 3 with 1 < ky < kp. 
If ky + lo = 1, then ky = 1, t) = 0, and therefore k,; = 1, 4; = 1. So we encounter 


Sii10%) = SOAP (AM) = YO(r— 1)? YO Clogp)* 


n<x r>2 al 
i psxr 


< Yie- 1)°(log(x y) me logp « (logx)? y- < x2 (log x)3, 


r>2 p<xt r>2 


(115) 


which is compatible with what (99)-(100) written for (k,, ko, t1,l2) = (1,1, 1,0) 
gives, namely 


Si1a0(x) = 0 + O(Cigx(log x)’). 


If kj +e; = 1, then ky = 1,4; = Oandk, + ly > 2. If ky = 1, then i. = | too, 
and we encounter Sj 1.0,1(x) which is the same as Sj,1,1,9(x) just settled in (115). If 
ky > 2, then t2 = 0 or tz = 1, and we encounter Sj 4, .0,0(%) or S1.45,0,1(x). Now, with 


k > 2, we have 


Siz0.00) = )) AMA (n) = > ((- ) Y= dog p)*t" (116) 


k-1 
nsx r>k 1 
_ psxr 


= 1 1 k 
2) (,= 1) (ae)! y- (logp) « we vee g Toes , 


>k 1 
rek psxr 


and this is compatible with 


CH dog x)! 
Si x00) =0+0 (aecer , (k= 2) 


which is (99)-(100) written for (ky, kz, 4, '2) = (1,k, 0, 0). Similarly, for k > 2, 
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Si2010) = AMM = DO ¢ ' “= © ttogpy 


n<x r>k+1 p<xt 


r—1\2r—k-1 1. \k+2 
< a \ ORI) (log p) 
2. ( k k + ! ( ) 2 


<« 


(log x)*t? 3 1 ig xFH (log x)? (117) 
(K+ DE oe re (k+2! 


which is compatible with 


CH (log xk! 
S1%01) =0+0 (see) » (k= 2) 


obtained by writing (99)-(100) for (k1, ko, 1, l2) = (1,k,0, 1). An examination of 
2 and &3 shows that the only new situation to check is for X17 with ky = 1,l2 = 
0, in which case kj = 1, and the sum to be checked is S2.0.00(x). But, trivially 
S2.0,0,.0(x) = 0, and this is compatible with S2.o,.0.0(%) = 0+ O(Cigx) which is (99)— 
(100) written with (k,,k2,t1,l2) = (2,0,0,0). Hence the expressions in (114) are 
completely justified, and in the few cases where the exponents of log * in the sums 
in the error terms may be negative the statement of Proposition 2 has been verified 
independently. 
We evaluate the inner sums (over p) in (114) using a variant of (87), 


(log p)* x\ _ (k- 1! pes (k—1)M! as 
> (ios *) = ETD! (log x) +0( 2 cogs ) 


psx 


(118) 
for k > 1,£ > 0, which is Lemma 5.4 of [5], and for X, + 2 + 2X3 we use 


ky P(k _ 1, ko _ 1,01, ¢2) + 2[t4 = (Pi + 1, ko = 1,0, 12) + P(ki, ko = 1,0, 12)) 
= P(ky, kz, 41, t2) (119) 


which can be verified from the definition of P(k,, ko, t1, '2). So we obtain 


Dy + Ly + LY 


P(k,, kz, 1, t2) 5421 420o— kp 
aE es Ca Ce 


148 Y. Karabulut and C.Y. Yildirim 


Lp Cit x (log xy thet2nt2n-2 ae ky 
(ko + 1)! logx 


_ P(k1, ko, 1, t2) 
(ky + kp + 2u + 2lo = 1)! 


Cat x(log x)" ky +20) +2122 
O 
” (max{k,, k})! 


x(log x)" +ko +2; +2t2-1 


(120) 
Here, in writing the middle member of the formula, we have invoked kz > k; which 


was taken as a convention, and in passing to the last line we have used 


ky P(ky, ko, ty, t2) ky + 1)! 
(+244 25—1l Oe)! 


in case the left-hand side is not 0, which is seen from (100). 
It remains to show that ¥74,..., 273 which were defined in (111) and (112) can all 


be included in the error term of (99). In X74 letting e = e’p, and then using Lemmas 3 
and 4, we have 


dg = ky Y “(dog p)? > A(ky — 1,41; e)A(k2 — 1, t23 e) 


psx ext 
(e.p)=p 
« k(logx)"74"~? S “dogp)? D2 Alko - 1, us pe’) 
psx <4 
~pP 
Crt k (log p)? 
l t+ko+2t)+2i2—4 
Seer s 2 


psx 
Cit” x(log xy kg +20) +2t2—-2 


< iol 


(121) 


provided that Cjg is chosen sufficiently large. It is easily seen that 2’; — d's are 
handled similarly. This completes the proof of Proposition 2. Oo 


Applying summation by parts to the result of Proposition 2, we immediately 
obtain 


Proposition 3 


P(ky, ko, U1, bo) x7 (1 ky +ko+2u+200.-1 
S> Alki. 4132) A (ko, ta: n)n = (ki, ka, 41, t2)2* (log x) 
n<x 2(ky + ky + 21 + 21 _ 1)! 
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Cte? (log x) +kg +20) +2t2—-2 
O ; 
+ (max{k,, k2})! 
(122) 


ys A(ky oly nA(é, 95 n) _ Pk Fi ko, ly, tn) (log x) tt +22-1 
n 2x2 (ky + ko + 2, + 2 — 1)! 


n>x 


Chit ] ky Akg +20; +2t2—2 
+o ( Gs_Coe») _ (123) 
x?(max{k,, ky})! 


7 Correlation of Zeta Zeros with the Relative Maxima 
of |5(4 + i)| 


Assuming RH, let 9 = 5 +iv, v > 0, run through the non-trivial zeros of Z)(s). For 
correlations of zeta zeros with the relative maxima of the Riemann zeta-function on 
the critical line we set out from (22) by writing 


4xi(v-v) 


124 
dX d 4+ (y—vyP = 
Fcusr 
=x? S° A(n)n? >» n&—x ° A(n)\n? > n ° 
nsx T<u<T Hox T<v<T 
T log’ T 1 log T 
08 o(—— = SE | + 067? log 7) 
An x2 log T 


for x > 1 and T > To. We see that we need to calculate 


Y> n® for n =p", p: prime, a € Z*. (125) 
T<u<T 
Just as in the pair correlation of zeta zeros we do not hope to get an asymptotic result 
for x > T, so that if we assume x < 7, then for the tail of the sum over n > x 
in (124) we use 


x Atay YO nv? KPT logT ee de eT!" logT. (126) 
n>TM T<vu<T n>T™™ n 


We will see towards the end of this calculation that any fixed M > 4 can be 
employed. Upon this simplification we write 
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Ayi(v—-v) 


a> eae (127) 


Y T<usT 


= =x? YP A(n)r? > ne?—x > A(n)n? = n° 


n<x Teu<T x<n<TM Leu<T 


, Dios T 140 1 +0 x2 logT re, log T 
Amx? log T T eS 


271-3 
+ O(xT ? logT 


forl <x <7” and T > Tp. 
We are now going to calculate, more generally than (125), the expression 


/ —k 
log T 
n?(~ (9 +b) for 2<n<T™,0<k<|—"_|,b=-20r2 
Q 
x log log T 


vy 
7 <UST 


(128) 


(where n and k are integers) for later use in Sect. 8; the special case of n a prime 
power and k = 0 is (125). The result will be formula (149) in Proposition 4 below. 
To calculate (128) we will use 


—k —k 
_ (x <= Zi x _s 
Dn *(Le+n) = ah 20 (Lo+) nds 


3 <US<T 


+0 (1? (log Ty) (129) 


where the contour @ is the counterclockwise traversed rectangle having vertices at 
—6 + it,,1+6-+ it,, (r = 1,2), with 


3 t= a + O(1), (r= 1,2), (130) 


~ log T’ 


chosen such that the horizontal sides of @ are a distance >> _ away from the zeros 
of Z)(s). The error term in (129) is the effect of such a choice estimated trivially. 
Just as the estimate for © (8) (see Chap. 17 of [2]), from an argument based upon 


the representation of F(s) as a sum of partial fractions (which is similar to that for 


y 
+ (9) since the count of zeros is almost the same) we see 


Z 
Z@ + it.) = O(log?T), (—l<o <2), (131) 
1 


Analogues of Pair Correlation of Zeta Zeros 151 


so that the integrals along the horizontal sides of @ satisfy 


1+é6+t, 7 ! —k é 2-k 2-k 
'Z n° (log T log T 
/ —(s) Xs +b) nds < (log 7) « | BT) . (132) 
s+, ZI Xx logn logn 


The integral along the right vertical side of @ becomes, upon using (64) and (16), 
1 I+é+in 7! ’ ak 
FL es (Lo+ ») n-* ds (133) 
x 


2m Ji4s+in Zi 


logT 
L fogtogT J 


_ G1)F 3 ot oo A“+))(m) ty (mn) 
2n £=0 m=2 (mn)'*° 4 (log x) 
L logT | 
ee 3 att AO(m) f? mn" 
on = a (mn)'+6 t (I g yer 


- ai F(1+6 + it) a 
nJy t (log =<)" 


8 


(log T)~* C22 log T log log log T 
+O exp : 
n log log T 


By (58)-(60), 


Z 
Fa +65+ it)| <loglog|t|, (|t| > to), (134) 
1 
. ee k log log T (log T)~* 
so the first error term in (133) is trivially << . The 
n(log T)*+! n 
integrals here are handled by the following lemma. 
Lemma 5 For A large, 
1 . 
[ wit dt = 0 (in meat) oe ie 1, (135) 
be B-A kA se 
A (log x) i ay +0 aad) ifw = l, 


where A < B < 2A,w > 0,k € N, k = o(logA). The constants implied in the 
O-terms are absolute. 


Proof We recall Lemma 4.3 of [21]: Let F(x), G(x) be real functions, G(x)/F’(x) 
monotonic and |F’(x)/G(x)| => m > 0 throughout the interval [a, b]. Then 


4 
<—, 
~ m 


B 
| / G(xje! dx 
A 
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Using this, if w 4 1, we have 


ea ae 4 
Ww oe < aa 
A (log +) | log w| (log 4) 
By Bernoulli’s inequality, we have 
1 1 1 1 


A me z= k log 2x « ] Ay’ 
(log =) (log ay* (1 — 2822) (log A) (1 -Kez) (log A) 


logA 


since k = o(log A), which completes the proof of the case w # 1. 
If w = 1, then integration by parts gives 


2 a dt 
+k f ae 
, a (iba lak 


In 


ie dt > 
A (log£)° (log £)* 


_ BA (22-2, 
(log#)" (log A)’ (log Ay) 


To finish the 


: : kA kA 
Since A < B < 2A, the last error term is < (og A FI <K tenet: 


by the mean value 


: (a | k 
proof we simply observe that (oz EY (oe BY +O ( ist) 


theorem. Oo 
By Lemma 5, and 


Q) — yj! 
=~ — (s > 1; 7 €Z*) (136) 


employed in (62), we see that 


log 
L log log Fl 


yo 2 


£=0 m=2 


ro A+) (m) ty (mn)~“* 


dt 
(mn)!+6 t (log a 


t 
Qn 
L log T | 

log log T oe) A+) (m) 1 


ey 
£=0 


= (mn)'!*8 (log mn) (log T)‘+é 


logT 
L log log Fl 


(log T)~* 
n'+5 logn 


2 \$ SO AG (m) 
«K ( ) 


1+65 
log T «Om 
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L log T | 


(log T)-* log log T 


2 £ t? +1 
(148 
X (ier) Ce0+9) 


LngtesT 
e (los Tj =. Cuan Z (log T)!~* re C23 log T 
énlogn 6 log T nlogn loglogT )° 


nlogn 


(137) 


A similar calculation gives that the same kind of bound holds for the part involving 
A in the right-hand side of (133). The bound in (137) is smaller than the last error 
term of (133), so we obtain 


1 pitétin zg! 


, —k 
— el o£ (s + ») n*ds< 
2mi Ji+s+iry ZA x 


(log T)~* C4 log T log log log T 
exp ‘ 
n log log T 
(138) 


It remains to calculate the integral along the left vertical side of @. In this integral 
we replace s by 1 — s, using 


Zi = Hy - Za 
Zz, = FO-Zd-9 (139) 


which follows from (56), so that we have 


—b+it Z! ! —k 146—it, 57 . aly 
/ Fs)( + ») nds = -| £9 (Xa —s+ ») n!ds 
- 1 


Sti ZI +5-in X 
146—it Z! y' —k 
+f —(s) (La-s+n) nds. 
146-in ZI x 
(140) 
Integrating by parts and then using the estimates (16) and (17) we see that 
1+6—it) 7 ! —k 
i “9(Ha-s+0)) n>! ds 
14+6-in X x 
! ! Tk ysl 146—it 
n 1 
= (5) (Za -s+) ~_| 
Xx x log n1+6-it2 
1 1+6—it ly ! —k 
— i (4) (s) (Za -s+) n—! ds 
logn Jits-in \X x 
1 1+6—-it) ./ d / —k iT T 1-k 
= if Le es (Za ere ») irigge SED. 
logn Jits-in X ds x logn 


(141) 
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Next, letting s = 1 + 6 — it, and using (134), (64) and (16), we have 


1 1+6—it; Zi / —k 
— —! (5) past) n’| ds 
2mi Ji+3—in Zi 


1 1+é5— if Z / —k loslosT 
— —()(La—s+o)) nt 'ds+o( 2-8 
2mi Ji4s—ir Zi Xx (log T) 


log 
(—1)kn? oar © AE+D (mn) ce dt 
r (1 


on m'+é6 1 \k+e 
L=0 m=2 +) 


Lis log J 
n i 1kn 8 sie a 3 rE )(m) fe )" dt 
Ts mF fr \n (I eee 


og 5 


II 


n 


VA t \—k-1 dt 
Olk 4-6 = 7)) (lee — = 
+ («J z, | + it) (log —) ; 
Cos log T log log log T 
O| Gos 17 bap | ee ee a (142) 
log log T 


Putting together our findings as of (129), we can write 


log T 
(—1)*n? ‘geet ACV m) 7 pmyit dt 
oe (7) 
2 mits a (og 2" 
2n 


ree) 


l=0 m=2 


Lister! 
4 ph pe geri 5 AOGm) fp? (2 
on m!+8 t \n (log aire 


m=2 


(Cag log T log log log T 
O Cc T)- exp ( ak alae )). (143) 
log log T 


We split the sums over m into three parts: the m = n term, the terms with 0 < 
|m —n| < 5, the terms with |m—n| > 5. The m = n term gives 


Leger! 
—~jk pr eee’ 220M AGtTD 
( ~ y 2( - rr) (144) 
T 
0 


onn (log r= ae (log = 
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and it will be seen that this is the main term. By Lemma 5, the contribution from the 
m # n terms to (143) is 


Ligier! 
2 a ai 2 L A+) (m) 4 mar (m) aoe 
(log T)* i ‘viser m!*9| log 2 
Form < 5 andm => 2 we have | log “| > 1, so estimating similarly to (137), the 
contribution from the terms with |m —n| > 5 is seen to be 

C27 log T 
& (log T)~* exp ( 27-28 |, (146) 

log log T 


For the terms with 0 < |m—n| < 5 we use |log “| > in—m , so the contribution of 
these terms to (143) is 
liter (+1) 2 40) 
log log T £ A A m 
2 (m) + jaz AO? (m) 
log T)* = 147 
ssyef) ps (a7) |m—n| on 
£=0 0<|m—n| <5 


Estimating the inner sum by Lemmas | and 2, we see that (147) is majorized by 


Lngios! 
a rk ; 2 log 2n ort (log T)(log log 3n + 7)¢4 1 | (log 2n) (log log 3n + 7)¢42 
oe = log T £! (€+ 1)! 
logT 
log T fgioe? J (log log 3n + 9) 
loo T)!-* Cie MEO ee 
ibe ee ( a log log T dX ral 


(og log 3n + 10), logT 


oe log T ioglosT 
< (log T) ‘exp (cn; 7 7) ma a 
08 108 LiogtogT! 
L logT | 
log T —- log log 3n + 9 
= (log T)* C29 ———— 1 1+ ——+]— 
(log T) exp ( eet) em = oz ( a j ) 


j= log log 3n] +10 
7 Tl (1+ SEPE™=*) 
1<j<|log log 3n] +9 J 
Lingiogr | 
log T 1 
< (log T)* exp (cs etaT ) exp | (log log 3” + 9) > = 


j= log log 3n] +10 J 


x (log log 3n + 10) los !og3nJ+9 


log T 
< (log T)~* exp ( C39 —2— J, (148) 
log log T 


where we have used (75) and that n < 7™. 
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Putting the estimates of (144)-(148) in (143) we obtain 


Proposition 4 Assume RH. Suppose that T > To, 2 < n < T™, where we will take 
M > 1 to be a suitable fixed number. Letk ¢e N,O<k < Le | and b = —2 or 


2. Leto = 5 + iv run through the zeros of Z\(s). We have 


Lister 
3 0 (£( +o) a Ss 2! 2A (n) AEF) (n) , 
n ami = — = 
x° 2an Jt (log Ly oF! (log LH 


f<v<T 2 f=0 


log Tlogloglog T 
+.0{ dogT)-texp ( GrlosFloslosiosT\) = aga) 
log log T 


From this result we deduce the estimate for (125) to be used in this section. 


Corollary 1 Under the conditions of Proposition 4, with 2 < p* < T™, where p is 
a prime number anda € Z*, we have 


4 Tlogp 2logp\" 1 
dy ON = (1+ =a) -2)-(1+0(c)) 


f<vusr 
C32 log T log log log T 
+0(cxo/ eae )). (150) 
log log T 


Proof We write (149) with k = 0. By (70)-(72), the last term of the summation 


is with € = min(a— 1, Lister 71). Then we rearrange the terms of the summation 


with respect to the power of log ree Using the values in (70) and (71), in the case 


< < Lair | + 1 the main term coming from (149) reduces to 


from which it is straightforward to get to (150). The case a > Lieber val + 1 differs 
from the previous case within an error 


T logp a-1 2logp a 2logp 
< a ye (";') (Ze) +2(.2,) (BE 
P log T 08 a5 O8 a5 
£=LiogiogT 
een +1 
a— (o) 
+ BP 
L log 5— 


Z T logp 5 a (= ese) 
pt Be rd log T , 
t= Liog joe Td 
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tal 
of Stirling’s formula, the last quantity can be simplified to 


+1 
Employing the inequality ¢ “ , < (5) , which is a simple consequence 


a-1 £ a-l £ 
T log p“ C34a lo T log p* C 
= Sto 
P log T e log r P log T e 
t= Licgioe? J = ipeioer | 
T logp* ( C36 log log T Bogor? = C37 log T log log log T 
<K <«K p ~ exp , 
pe log T log log T 
and this is contained in an error term of the type as in (150) (if necessary by replacing 
C32 by a larger constant). oO 


We now resume the calculation for the correlation of zeta zeros with the zeta 
maxima on the critical line starting with using (150) in (127). From the sum over 
n < x we have 


1 
=x? S* A(n)n? > ne= a) yt - “ait (1 a o(=7)) 


nsx t <u<T psx 


21 
= © pttlozp)? (1 at wer (151) 
psx | ogT 

a>2 


1 C38 log T log log log T 
, = exp ( 3g log T log log log )Xaom?), 
x 


log log T = 


and from the sum over x < n < T™ we have 


a (log p)” 2log p 1 
-- yon 563 = ( 28 (1 +-0( 22) 


x<n<TM g T eu<T 7 cpeT™ P 


(log p)* 2logp ¢ 
O| Te 1 152 
+ # dL p + log T ( ) 
BPS 
+0] x ex C3g log T log log log T A(n) 
log log T nm f 
x<n<IM 


Now we use some consequences of the prime number theorem obtained by partial 
summation. The contribution of the last error terms of (151) and (152) is 


Cyg log T log log log T 
« xexp( noe a oad ae ). 


153 
log log T ares) 
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For the main terms of (151) and (152) we use the following which are obtained 
from (27) by partial summation: 


2] _ 3 
Y° plog” p =e — — - + O(x? log? 2x) 


psx 


2] 2 2] 2; . 
¥plog’p = 2 * —* +o + O(x? log! 2x) 


= 2 2 
log’p  logx 1 log? 2x log T 
= — O O 154 
Dp ae ae Og Ol ee (154) 
x<p<T™ 
logsp  log?x  logx 1 log’ 2x log? T 
= O O 
mo Pp 2x2 + x2 +r 4x2 +r + ~ [2M 
x<p< 


Then, the main term of the sums on the right-hand side of (127) is seen to be 


ne 2log?x 1 logx Hee, 1 (155) 
— (@) _— —_— _— 
An = log T 2 logT log T 


+ O(Tx7? log? 2x) + O07? TT!" log T). 


Just by comparing (153) and (155) we see that we are forced to put a restriction such 
as x < T!~ in order to have an asymptotic estimate. It remains to bound the first 
error terms in (151) and (152). We have 


(14 2BRY" A am, 
log T i 


it 
Xa 3 
“(log p)? < log(x« l 1 — < x? log 2x, 
do P(logp)” =) xlog(x+) D | logp < xlogx ) | — x? log 2x 


pisx a>2 1 a>2 
a>2 psxa 


1 1 
(log p)? log(x@) logp | logx wvxa log x 
Ds oi «> om > @ <B «=. 456) 
* 1 


2 
x<pt<™ a>2 a>2 x 
a>2 


p2xa 
so that the first error terms in (151) and (152) are 


< Tx”? log 2x. (157) 
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Combining (127), (153), (155), (157), and taking a fixed M > =, we have 


4xi(7—v) 
eB 2» tae (158) 


<vu<T 


_ Tlogx 1 2logx +0(T) + T log? T 140 1 
Ag log T Anx? log T 
for 1 <x < T!<asT > oo. 


Now we write the version of (158) for each case with rast <u< ne k= 


0,1, ee id 
log" T 
estimate 
> > gs pee” (159) 
-lo : 
Y O<v< T ‘ 7 ( ~ v)? log* T - log? T 
ie | 7 i 


Then, in the same way as for (24), we discard the terms with y ¢ (0, 7] within an 
error of O(log? T). Thus we obtain 


Theorem 1 Assume RH. Let 5 + iy and 5 + iv run through the critical zeros of 
f(s) and of Z,(s), respectively. We have, uniformly for 1 < x < T'~, 


4xi(7-v) 


Prey) = > > tia (160) 


0<y<T 0<v<T 


T logx 2logx T log? T 1 
= 1- 1+ 0| — O(T), 
2 ( log T ) = Qn x? a log T 7 


as T — , in which case Fy z, (x, T) is asymptotically real. 


Remark Since the main term of F¢z, (x, T) has a sign change in 1 < x < T!~£, we 
can’t deduce conclusions as Montgomery did using the positivity of F¢¢(x, T). 
8 Pair Correlation of Zeros of Z;(s) 


We shall first derive an explicit formula involving the zeros of Z,(s), an analogue 
of (19), which relates a sum over the non-trivial zeros 9 = 5 + iv of Z;(s) toa sum 
over involving arithmetical functions. We begin by considering the integral 


Z 1 
f — (w+ ~)K(w,s)x"” dw, (x> 1), (161) 
RR Z\ 2 
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along the rectangular contour & with vertices atc + iY, —U + +iY, with 


1 
c==+6€, (€ > 0can be arbitrarily small but fixed), 


2 
Y 
U=2k+1, (k: large natural number), a > 5 > || 
3 11 
w=utiy,u,yER, ve[54 (162) 


(the restriction on o is just for convenience, we can take o > | + €; later on we will 
use our explicit formula with o = >), and 


CSG) ee 2) 


The kernel K(w, s) was introduced by Farmer et al. in [5] in their derivation of an 
explicit formula involving the zeros of &’(s), where &(s) = oD 7-2 I'(5)é(s). The 
integrand is regular on the contour of integration. By (57) the left vertical side of Z 
is at a distance > } from any zero of Z,(w + 5). By (139), we have 


Zt op ge V2 (op) ne) ope ae 
Z a ys a °°) FZ, 5 YY? 


and by using the last member of (16) we see that 


K(w,s) := 


(163) 


Me ae 
| —-2k— = + iy) = O(log |w)). 
x 2 


From (58) we observe that 2 (2k + ; — iy) takes on small values because here the 


values of © and © are small, and the estimates (16) and (17) are applicable as can 
be seen through the middle member of (14). Hence we have 


Zi 1 
— (w+ =] = O(log|w|), (Rw =-U = -2k-1), Uo8) 
Ti 2 
and 
in 2 (w+ +) Kow,s)x%aw « TO80 (165) 
at _ 5 5)X , 
-u-iy Z a 7 " x9U? 


The estimate (164) also holds on the part St w < —1 of the horizontal sides of Z as 
can be seen again from (139). On the part —1 < iw < c of the horizontal sides, we 
use (131). Thus the horizontal sides contribute 
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=1 [n52 -Y -U 2 2 
log’ Y_, logY , log(u + Y*) _, 
« | y2 du f ye aut rs x" du 


‘ log?Y  logY _ylogY 
os 


KX 
y2 Yx Y 


(166) 


If we first let U — oo, and then let Y — oo, the integrals along the horizontal 
and left vertical sides will tend to 0. In the region enclosed by &, the poles of the 
integrand are all simple poles at the points w = —5, 0, 5 5-5, iv and at the points 
5 —z, eZ". Thus, by the residue theorem we obtain 


1 Z; 1 1 _1 1 1 
; wt K(w, s)x" dw = —K | —=,s)x 2 + K(0,s) —2K [| =, 5] x2 
201i (c) Z| 2 2 2 


zi - 
—~ 4g —5)x275 
Z| 


Cc 
: 1 7 
+ 2 K(iv,s)e” + OK (5 -28) xt, 
4 f=1 


(167) 


where / denotes that the integral is over the whole line St w = c. To simplify this 


(c) 
formula, we use the estimates 


|s(1 —3)| 1+?’ 
xl.) 3) - Cea D# Z x 
a7) | ieee ae 
20-1 1 
|K@,s)| = = 


pape 


and we majorize the last sum in (167), by using the information given in (57), as 


We can now re-write (167) as 


1 Z 1 20 — 1)x'¥ <7) 
| ! (w+ ) ons) dw = > ad ue x 1d —5) 
20i (c) Z| 2 - (o- 5)? +(t— v)? Z| 


+0 e +0 < (168) 
1 + ||? 14 |e Jo 
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For use in the evaluation of the line integral in (168), we expand the expression 


in (58) for A (z), where tz = c+3 = 1+¢€) > 1,3z > AandA is some sufficiently 


large constant, by keeping the factors £(@) intact. Assuming RH, we have 


ze beter! ¢ 9 \* 2, garg ber! (> Se) 

Wigs ok as - » 7 : 

> (2 = > (: d 2. (=) mé® 
m= x 


k=0 (z) ue k=0 m=1 
¢T 1 
O C —logl Oo | ————__ }.. 
- (ex (( eae cal’) Geigr)) + (accu) 
(169 


Next, we split the integral into several parts as 


1 Zi 1 
/ : (w+ ) Kew, s)x" dw (170) 
20 (c) Z| 2 
ligt! (oe) (k+1) 
Ce a 
= 1 . ; K 
k=0 m=1 m? 2ni (c) (4 (w So )) . 


m=1 m2 


Liogiog? J 
uM ns 1 K(w,s) x\w 
+ dX (-2)** y 4 ae (z oak yn ( dw 


Le | _ 

. loglogT x oy a 1 i K(w, s) e )' Pe 
oo” . / k 

k= m=1 ami Oa (< (w + +)) 


Ligier! 
: A®(m) 1 pet K(w,s) x\w 
k+1 y , 
7 ~?) / k+1 |) oe 


m2 Qni ei d 1 
k=0 m=1 " (Zo + )) 
iA Z 1 
“GO a “le + = + iy)K(c + iy,s)| dy 
-ia |Z) - 


re —logl log T 
+o(x f IK(c + iy.) (co( 39(€0) — log log |y|) log ) 
Iyl=A 


log log T 
mn 1 
[y| log |y| 


We will work with large |t| € [2A, T]; later on T will be a parameter which tends to 
oo. By using 


) a) =l+h+---+]6, say. 


K(c+iy,s)< = = Tt 6-8?’ (171) 


Analogues of Pair Correlation of Zeta Zeros 163 


it is easily seen that 


c xe - (ee log “) (172) 


x 
I —~_,, I 
oe 14+7 as 1+ |e| log log T 


Now note that since the zero of y’(s) is in (3, 2), if we take €9 < ! then by (16) we 


j -1 
have a + | < C4), so that 


Logie?! 0° 
g log Akt) (m) A 
h<Kxe® DY 2Cu)* > sae |k(c + iy, s)| dy (173) 
k=0 m=1 m2 —A 


log T 
Cc Liog josT 


x ¢’ yy" xe log 
K 2C41 > z K —>C loglogT 
142 2 ( ay (c+ ;)) faz 42(€o) 8 


and similarly [4 satisfies the same bound. Thus 


1 Zi 1 xe C43(€o) log T 
K(w,s)x” dw =1 L O - F 
220i I, Zi (» + 5) aaa ay ee (= exp ( log log T ) 


(174) 


The evaluation of J; and Jy splits into two cases according as m < x 
or not. Consider first the situation when m < x in which case the line of integration 
itw = c will be pulled to the left. Let UV = 2n + 1. Then, by (14), we see that 
£(-U+ 5 +it) > log U. Between the lines t w = c and ‘i w = —U the integrands 


/ 
have poles coming from the zeros of the = factor atw = ke — 5. B= V2 ccc kn, 


at w = tit by (41) and (53), and a pole arising from K(w, s) at w = 5 — 5. By the 
residue theorem we write 


ca cHY — K(w,s) a 


Qi Joi ‘i\" 
. (4 (w+ 2)) 
U-i¥ U+iY ctiY K(w,s) Pree 
= Residues + + + : (“) dw 
cai —U-i —U+i 1 
Y U-iY U+i¥ (< (w+ +)) 


with the parameters having the same specifications as in (162). The last three 
integrals all tend to 0 asn — o (i.e. -U — —oo) and Y > o as in (165)- 
(166). Now, applying Cauchy’s estimate on a disk of radius €; centred at kp — 5, for 
k > 1 we have 
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Res K(w, s) =), 
w=k—1/2 (Lov + 1/2)) 


ee, Paes ae Z w— (ke — 4) aera 
~ (k= VD! dw! 8) L(w + +) (;,) 


Cua(er)* (2) 34 
(€ + |#|)? 


< 


Summing over all £, we can include the contribution from all of these residues in 


Cas (1) Geyer 


1+ |t| \m 


Similarly we see that 


Res K(w, s) (=)" i Ca(e1)* (2)" (k>1) 
=e : k eer tony =e 
w = tit (4 +1/2)) m 1l+t 
and 
Res K(w, s) x\w x \o payht 
w= 1_g x! 1 z (=) amas aT (<) 
(E(w +)) 
(foro € [5. 7] this pole is away from the zeros of LC — 3s)). Collecting the 


foregoing estimates, we obtain (with no error term if k = 0) 
1 K(v, w if Tk 15 
5 .) way) : (=) a =-(La-9) (=)’ (175) 
Ti Sie) (x 1 m x m 
(E+) 
£6 Cas(€1)* ey 
1+ |¢| \m 


Caper). 7 x\ 41 
+0( Sr (<) \; (m<%). 


In case m > x, the calculation is done similarly, but now the line of integration is 

pulled to the right to 8 w = 2n, followed by letting the integer n — oo . The poles 
J 

encountered are at w = 5 —k,, € = 1,2,... from the 7 factor, and at w = s — 5 
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from K(w, s). The result is (with no error term if k = 0) 


a }. ( Ar 2 k (=) dw =— (Xw) - (176) 


7 (w+ 5) 
Caz (E1)* x 5k 
o( —ar (-) ). (m > x): 


Writing, for brevity, 


Liogioe7! , \-k yp \ Wk 
L(m,s):= SY > (-2)* (aero (£00) +20%m (£09) 


k=0 
(177) 


we see from (175)—(177) that 


ntn=wl (Semi —9 (2) + 20m.9(2)) 


m<x m>x 
| 20ee logT | 
kin ate ToglogT A®+D (@n) +1 (m) 
k 
+O 1 + “14+iq. 2 Cag(€1) » mates 


msx 


€1 


log T 
ae A&) (m) + A (m) 


> Cao(e1)* >> oe 


max 


irene ligt | A&+D(m) ze A® (m) 


k 
oO cea hy 2 Cso(e1)* > nee 


m>XxX 


We can put a bound on the error terms by first replacing m~*'~! with m~!~¢!x!—*! 
in the first series over m < x, and ma27 6 by x2m7!~! in the second series over 
m < x,and m7!t1+4 with m~!~¢1x*1+«1 for m > x, and then using (136) to bound 
the resulting Dirichlet series, so that we have 


ienar!(Sami-a(2)""+Lae.9(2)) 


m<x m>x 


1 
x2ta Cs1(€1) log T 
O ; 178 
T (Fi ( log log T ) ame) 
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The last step for completing the derivation of the explicit formula for Z)(s) is 


replacing the term er — 5) in (168) with 


aX 76) =x) “og Ht + O,(x2-°), (179) 


x? 6) — x2 


by using (139). 
Combining (168), (174), (177)-(179) we obtain 


Proposition 5 (Explicit Formula; Assuming RH) For s = o + it with ; <o 
see; 1 |t| € [2A, T] with A a sufficiently large constant, oe any dibianly small 


Arete =U, we have, as T > ~, 


ene al (Cains 6) aim (2) 
Q 


(o m<x m>x 
il 
i |¢| xate Cs2(€) log T 
2 Sloe — +0 
BRE Eee 2n + ( It| = ( log log T 
+0(2-°), (180) 


where &(m, s) was defined in (177). 


In (180) we take o = 5 and sum both sides over t = U € , T], where 5 + iv 


runs through the non-trivial zeros of Z,(s). In view of (55) and (1) we have 


4yiv— v) 
LoS asp 


T<t<T 


=? y 2(m. -; +18) mi-® 27 y 2(m 3 +i) mi 


msx f<t<r m>Xx Leo<r 


T log’ T 1 C. log T 
oo og” 14+0|——)\+olz2* exp Cs3(€) log T ; 
An x? log T log log T 
We do not hope to get an asymptotic estimate for x > T, so just as we did in 
connection with (126), let us assume x < 7™ and first discard the terms with m > 


T™ for some fixed M > i. Upon using (16) and trivially estimating the sum over U 
by (55), we have 


2 YD e(m seis) wh 


m>TM T<o<T 
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LiogkeT | 15 —k 

5: sae 

=2 > (-2)) mf y Aen ( (3+) ne 
k=0 m>TM! T<v<T x 


5 Ut) 
+E neeo(E(ire)) we] 


T<o<T 


log T 


Liogioe | 2 k AXED (m) 
<K xT logT » Te 
m>TM 


T 
= log = 


Lrgter oo AM vO), 
7 » 
ot a m3 


k=0 m>TM 


By (65) and (66) we see 


AC) z (log perl AW 4 z (log per 


5. ? 3 bs 
7 3M 5M 

m2 1 2 m 3 1 2 

m>T™ 


m>T™M 


so that 


5 . CsalogT 
x > > £ (n 5 + io) m2-® < 27'-T exp (Se) : (182) 


moO™ Epa log log T 


For the m < T™” terms we use Proposition 4. We first note that the contribution of 
the error term of (149) to (181) is 


Cs5 log T log log log T 
xexp ( 55 log T log log log ). (183) 


log log T 
by a calculation similar to that leading to (182). Putting these together we have 


4yilv— v) 


Ss 3S 44+ (v—d)2 ve 


U f<t<r 


_ U(O,0;x) — 2U@, 1:x) + UC, 1x) 
7 2m x2 


a = (UO, 0; x) — 2U(0, 1; x) + UC, 1;x)) 


T log? T 1 Cs¢ log T log log log T 
1+ 0{| — O ; 
= Ao x? ( ‘ (= 7) aE (x oe ( log log T )) 
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where, for 1; and 2 taking on the values 0 or | and A(k, 1; n) defined in (77)-(78), 


Lister FJ ky +ko+ty +12 
-) dt (185) 


veund= fe (meg 


lo 
ky .ko=0 8 2x on 


x D7 Alki + [1 =O]. 3m) Ae + [2 = 0], osm), 


max 


lopter Fd ky ko tty +02 
-) dt (186) 


Gad fe (meg 


lo 
ki ,k2=0 8 on tn 


> A(ky + [t1 = al id + [to = OQ], 23m) 


m 
x<m<TM 


Formulas (122)—-(123) in Proposition 3 are for the evaluation of the sums over m 
in (185)-(186), but for this application we need to extend the range of the last sum 
of (186) to all m > x. We have, using (65) and (66), 


y- A(ky + [ty = O], 413; m) A(ky = [to = O], t2; 7m) 2 y- (log m)ki theta +242 
3 = 3 


m m 
m>T™ m>T™ 
(log TMyathtatetl 
< T2M 
(187) 
When plugged in (186) this brings an error 
Nesey TJ ki thktuy+iy ligt | 
2M log T 
«T'™ IogT Y (a) «TI ™ logT Y> GMyite 
koe0\ 108 az ki,k2=0 
3logT 3 
< T!—™ (log T)(3M) Pebet << [IW 2M (188) 


which is very small for M > ‘. In applying (122)-(123), the error terms of these 
formulas give rise to error terms of the form 


ieee ky +ko+¢ 

1 2Cig 1 

«T \- (ae , (€=0,1,2), 
ky,ko=0 max(k1,k2)! \ log 7 


and recalling that x < T™ , we see that these error terms can be bounded as 
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logT 
L log log DglogT! ct +k, Lig ibe og iene?! C2 


T 2 _ max(as El ar aah Csr 2G (189) 


ki <ko 


logT 
L log log TogiogT! C2 


<T 235 at saeor= CsoT. 


ko=0 
Now, going back to (184) we have 
Ly a 
(190) 
4 
<b v<T . 4+ (uo) 

L i | atk 
7 ba P(ki + 1, ky + 1,0, 0) (log x)ki +2 +1 ia ( > a P 
_ t 

re: (ky +khot+ 1)! z log ae 

ki+h+1 
5 Pki + 1, ko, 0, I)log.xy t+? i ( 2 _ dt 
(kj + ky +2)! Tr \ log = 


ky +k +2 
P(ki, ky, 1, 1) (log x) +2 +3 [ 2 tk, . 
(ky + ko +3)! r \ tog 5 


T log? T 1 C56 log T log log log T 
1+0 O O(T). 
a Ax? ( * (= )) a (xex ( loglogT 7Ow) 


Here we plug in the values of P(k,, ky, t), 12) given in (100) and we use 


£ £ 
Fi 2 T[{ 2 € logT 
/ ~) a= = (1+0(5)). (cen,t« -e), 
Z \ log 35 2 \ log oe log T log log T 


(191) 


and upon organizing the terms we obtain 


4xilvu— v) 


ae 4 


TI 1 Lngler 2-k (21 ial loglog T 

0) 0) “KI (0) oglo 

ee Gee - +> —— (140 (AEE) 
4a log = par (2k + 2)! log 5— log T 


1 C56 log T log log log T 
O{ —  ]]+0O|[xexp ee ee + O(T). 
log T log log T 


T log? T 
An x? 
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We would like to extend the sum over k to an infinite sum. Observe that for x = T 
we no longer have an asymptotic estimate, so we take 1 < x < T'-©. Then 0 < 


205% < 2, and with K = [227] we have 
log 37 log log T 


s kl 2k+2 3 1 i k+1 2 ie 
ain <E(ch)"*s2 (eh) 
& (2k + 2)! HN\k+1 =k ra! 
ea eae oe cle (eae ot 
~K\K+1 K+1 \K+1 


1 4 K+1 1 
-7(z) == 


K+1 


log T 
iT 1 T 4] 1 T \ loglogT 
og log ( og log ) eipeltx 


(193) 
log T log T 
Hence (192) can be re-written as 
4xlv— v) 
Saw as 


UV fe td 
Tl l = dekt [20 ire log log T 
_ Flogx|, _ , losx > = (1+0(% og )) 
4a log £ an 4 (2k + 2)! log an log T 
T log? T 
4m x2 


1 l-e 
(ser) ) +O. (d<x<T'‘). 


Now we do the same process as after (155), and obtain 


Theorem 2 Assume RH. Let 5 + iv and 5 + iv run through the critical zeros of 
Z\(s). We have, as T — ov, uniformly for 1 < x < T'~, 


4xi(v—v) 
Fz,,2,(x,T) = t Feuroe (195) 


O<v,0<T 
Tl re. ac ey a 21 oe log log T 
ogx og x "Kt ogx og lO 
_ Tlog g > g: (1+ 0/ g log )) 
Qn "pe 2 x 4, (2k + 2)! \ log x log T 


jo OEE (pga | + O(T) 
20x? logT ‘ 


The formula (3) also holds for Fz,.z, (x, T). 


Remark The asymptotic value in (195) is the same as the value found by Farmer 
et al. [5] for the pair correlation of the zeros of the derivative of the Riemann 
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€-function. In [5] an explanation of why the leading order terms of the pair 
correlation functions for &’ and Z’ should be the same was given. Since the leading 
order of the correlations are the same, the same conclusions can be drawn from 
Theorem 3 as Farmer, Gonek and Lee’s results. Thus we can say that, assuming RH, 
a positive proportion of the gaps between consecutive zeros of Z;(s) are smaller than 


2 
0.89661 times the average spacing (~ ar). and more than 85.838% of the zeros 
og 


of Z)(s) are simple. For the proofs of these results and references to earlier related 


results we refer the reader to [5]. 
We now define, suppressing the dependence on T in the notation, 


r= (Four) [ea((Z) 7) +200 (ZY) +00 (ZY )] 
(196) 
so that 


= T -1 4 T ia(t;—t2) 
F(a) = (F087) > awe (=) . (197) 


0<t),o<T 
¢-Z1(4 +it/)=0, (7=1,2) 


T 
Note that we have taken x = — because of the powers in the infinite series in (195). 


sa 
This won’t make any difference in the asymptotics for F;¢ and F;z,. Then we know 
from (4) and Theorems | and 2 that, for 0 < |a| < 1 — with any arbitrarily small 
but fixed e€, 
1 (m—))! 
F(a) ~ 2\a| — 4a)? + = 
(a) ~ 2la| —4lal? +5) 


m=1 


(la |! 4 oT fog, (198) 


as T > ow. 

In order to show the existence of small gaps between zeros of ¢ - Z; along with 
a value for the size of such gaps, we follow the method developed by Montgomery 
[17] and later on also used in [5]. So we will take h(u) to be a minorant of the 
characteristic function of the interval [—1, 1] such that the domain of its Fourier 
transform h(a) includes [—1, 1] and is non-negative everywhere. Furthermore, to 
minimize losses we will require = 1. Montgomery [17] used the pair h(u) = 
max(1 —|u|,0), (or) = (222)"; in [5], h(w) = (2824)? + 
minorant of the characteristic function of the interval [-1, 1] sith Fourier transform 
h(a) = = max(1 — |a| + sindalal ,0) was taken (the use of the latter ae pair in 
such a context goes back to 7 ]). Let m,, denote the multiplicity of a zero 5 +t of 
€-Z,(s), and A € (0, 1). Then we see that 


>> im, +2 = l= 2 el (v- nycer). 


0<t) <T 0<t ,t2<T O0<t t22<T 


ma 
0<ty “hi eT 


(199) 
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and since 


2u _ ear, 2m ie 24 _ A mar AB 2nipu 
(3) = [tere Sda= Sf i(F)e dp, 


we have 


Sse 12 (Fer) > f rp ( ) ap 


0<t) <T 0<t.o<T 
T x l-—e 
z (Zier) 5 fa en( >) ap. 
mT 2 J-ite 


0<ti-h< 
In the last step we have used the non-negativity of h and F, the latter seen in the 
same way as (3). We may assume that 


T 
> My ~ BE 


0<t<T 


mh 
logT 


for otherwise there would be infinitely many multiple zeros of ¢ - Z, and there 
wouldn’t be any need to prove a result for existence of small gaps between zeros. 


Then 
1 (L-) Emer) (103 [5 ai()) 


0<t) .to<T 
0<t\—-h< 


ma 
logT 


(200) 


Since this holds for any fixed € > 0, we want to find an as small as possible value 
of A for which 


+3 fs rpn( ?) ap >o. (201) 


1 


Choosing the Fourier pair h, has in [5], by (198) the condition (201) is cast into 


af pe- oe eS . Happrtaar oer (128 4 S28) a 


m=1 


> 1. (202) 


Some computing reveals that (202) is satisfied when we take A = 0.78842, and we 
can state 


Analogues of Pair Correlation of Zeta Zeros 173 


Corollary 2 Assume RH. There exist infinitely many gaps of size < 0.78842 of the 


average gap (~ ) in the sequence composed of the zeros and maxima points 


1 
log T 
on the critical line of &(s). 

Remark Since the critical zeros of ¢ and Z, are interlaced these small gaps are 
between a zero of ¢ and a zero of Z;. The result of Corollary 2 doesn’t seem not 
very strong, but it is better than the trivial bound 1, and it is the first result of its 
kind. Applying (198) for the proportion of simple zeros of ¢ - Z; doesn’t give a 
bright result. By some elementary considerations and Hall’s result given in (55) 
we may take that a non-simple zero ; + it; of €- Z, has multiplicity at least 3. 


Then, Montgomery’s argument for the proportion of simple zeros, together with 
3 
y 1> y — and a numerical fact from the proof of Corollary 1.3 of 


0<t)<T 0<t<T 


t1: simple 
T 
[5], gives the rather dismal estimate > 1 = 0.38 (< log r). That these results 
0<t,<T Gs 
t: simple 


aren’t as strong as those obtained from F;¢(x, 7) by Montgomery is related to the 
fact that F;z, (x, T) changes sign at x ~ fT. 


9 Correlation of the Zeros of Two Dirichlet L-Functions 


In this section, we apply the ideas of Sects.3 and 4 to correlating the zeros of 
two Dirichlet L-functions associated with primitive Dirichlet characters. Let y be 
a primitive character to the moduli g, (so that g, = 3). In this section the Dirichlet 
characters will always be taken to be primitive characters to moduli which are fixed 
(i.e. not a function of 7). The non-trivial zeros of L(s, 7) have real parts in the strip 
o € (0, 1), and the number of these with ordinates in [0, 7] is 


T qyT 
N,(T) = a log + 


oe! + O(logq,T), (T= 4) (203) 
2n 20 

(see, e.g., Corollary 14.7 of [19]). We shall assume the Generalized Riemann 
Hypothesis (GRH) for Dirichlet L-functions that all of the non-trivial zeros of 
L(s, x) are on the critical line. Let py = 5 + iy, run through these zeros. Quoting 
from [23], the analogue of Montgomery’s explicit formula (19) is 


> (Qo - aa . _ = (Zam Ce é Ye A(n) x(n) (y"") 
Xx 


1\2 2 
Vx (o — 3) oie n<x n>x mt 


L : 
— zi —o + it, yyx2 ott 
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= (20 — 1)x72 8% 


¥ py ea er er Cee ETE 


r=1 


(204) 
valid under GRH for o > 1, and all x > 1. Here 


0 if x(-1) = 1, 
= : 
1 if x(-1) = -1. 


We employ (204) witha = 3, then while going through a procedure paralleling that 


of Sect. 3 we substitute 


1 
It| 


L’ t L! 
Fach a tog (22) rad im + 0( ).C2<o<3 men 


(205) 


which follows from logarithmic differentiation of the functional equation of L(s, 7), 
and we obtain the analogue of (22) 


4yi(vx—-9 5 a or 
= - x A(n)x(n)n2—" — x A(n)x(n)n—2~" 
2 4+ (y,— 1)? dX 2 
+ x *(log dy\t| + O(1)) (206) 


for |t| > 1 andx > 1. 

Now let y be a primitive character to the modulus qg,, and let ¢ run through the 
ordinates yy € (1,7] of the zeros of L(s, y). Summing (206) over this set of t 
values, we have 


¥ > Axi(vx-Vw) 2 T log? T (: he ( 1 )) s 
Vy 1<yyST 4+ (yy - Yu)? Qn x? WM log T 
=x? SAM) y@n? SO nv 
nsx 1<yy<T 
225 = A(n)x(n)n* ~ nov, 
n>x l<yy<T 


where we have used 


ae 1 
> (log qyvy + O()) = 5, 08 T (1 Op a (<>) (208) 


l<yyST 


which follows from (203) by Stieltjes integration. 
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At this point we need the analogue of the Landau—Gonek formula (25) for L(s, w) 
to be used in (207). A meticulous GRH dependent version was given by Fujii [6]. 


Proposition 6 For y, T > 1 we have 


> yr = -5-V0)A0) + O(y log(ygyT) log log 3y) + O (tog min (7. ~)) 
1<yy<T 


1 log? 
+ 0(log(qyT) min( 7, —))+o0(~ 2%). (209) 
log y log 2y 


The proof of this proposition is similar to Gonek’s proof of (25) in [13], so we 
omit it. The only slight difference is in estimating the contribution of the segment 
along t = | (in case there is no yy = 1; if there is a yy = 1, then this segment 


can be shifted a little bit), we use Lo +i) < log? dy for —1 <o < 2, giving rise 
to the last error term in (209). If GRH is assumed, as we have already done, and if 
y =n >= 2, then the argument gives 


= T log? dy 
Son PY = ——— p(n) A(n) + O(log n log logn) + O(log qyT) + O 
ley <P 2mn logn 


(210) 
Plugging (210) into (207) we have 


» Axix-Y) 7 T log? T (: a ( \ )) a 
qd 7..7 
Vx l<yy<T 4+ (Yy - vy)” 20x? xe log T 


Ayn) 


3 


T = Tx? 
ta VON += 


n 
nsx n>x 


x log? dw 
log2x ]° 


+ O(x log 2x log log 3x) + O(xlog gyT) + O ( 


The summation conditions for the double sum on the left-hand side can be changed 
into 0 < yy, ¥y < T within an error of O(log(q¢,T) log(qy,T) log T) as can be seen 
by a standard estimation. 

If y ~ w, then from 


S> An) (y(n) K x2 log?x,  (qyqv <9), (212) 


n<x 


on GRH [2, Chap. 20], we have 


2 2in\w 
a Yaad p(n) + os Xe “ eRe « Tx"? log? 2x, (213) 


so that there is no main term of size T log T for x > T*. 


176 Y. Karabulut and C.Y. Yildirim 


If y = y, then with gy = qy < x, the terms in (213) are the same as in (26) for 
the pair correlation of zeros of ¢(s) except for the summands with (n,g,) > 1. The 
effect of these latter terms are 


T 4 T log x log qy 


(214) 


n<x 
(n,qy)>1 


This completes our calculation in this section, and we state our findings as 


Theorem 3 Assume GRH. Let 5 + iyy and 5 + iyy run through the critical zeros 
of L(s, x) and L(s, ), respectively. Assume that the moduli q,, qy are fixed. Then, 
as T > ~, 


4yiVy—-Vw) 


O0<yy<T 0<yy<T 


Spas (= ie o ( EPetr)) 


20 x 


& T log’ T 140 1 
nx? mee * oe Tt 
+ O(Tx~? log? 2x) + O(x log 2x log log 3x) + O(xlog 7). 


Remark We note that if y = 1 is taken then the preceding calculation is the same, 
and so we obtain the correlation of critical zeros of ¢(s) with those of L(s, x). 


Remark If x is a non-real character and y = X, then the zeros of L(s, w) occurring 
in F',y(x, T) are the reflections with respect to the real axis of the zeros of L(s, x) 
which are below the real axis, and for x > T* the main term is again 0. Thus the 
zeros of L(s, y) which are above the real axis can be said to be not correlated to 
those which are below the real axis. 


Remark In [23] a certain average over correlations of zeros of two Dirichlet L- 
functions of the same modulus were obtained by following Montgomery’s method. 
Our theorem here is more particular. 


Remark Gallagher’s citation of oral communication in [7] seems to indicate that a 
result of the same kind as Theorem 3 was observed by A. Fujii. 

We now work on the product L(s, 7) - L(s, Ww), where y # w are primitive 
characters. Similar to (196), let 


T —1 
§(a) := (- log r) [Fyy(I%,T) + 2Fyy (1,7) + Fy.y(T,T)], (216) 
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so that 


ii > 4 ; 
= {|—logT pie (tt) 
5 (- = ) 2 4+ (t — tf)? 


0<t .to<T 
L(5 tit) ~-L((5 +itj.W)=0, G=1.2) 


(217) 


Then we know from Theorem 3 that, for 0 < |a| < 1 — € with any arbitrarily small 
but fixed e, 


F(a) ~ ja| + 277-7" logT, as T > oo. (218) 


For small gaps between zeros of L(s, y) - L(s, w), proceeding as in (199)-(202), we 
reach the condition 


1 . 
af E + 27? log r| ( en ee) dp > 1. (219) 
0 2 20 


Some computing reveals that (219) is satisfied with A = 0.68742, and we have 


Corollary 3 Assume RH. There exist infinitely many gaps of size < 0.68742 of the 
ca 


average gap (~ ) in the sequence composed of the zeros of L(s, x) + L(s, W), 


log T 
where y # W are primitive characters. 
Note that the small gaps detected here may be coming from a single L-function. 


10 Further Problems to Study 


The method presented in this article may be applied to correlating the ordinates of 
zeros of the Riemann zeta-function or Dirichlet L-functions to various sequences of 
real numbers (and possibly generalizable to other functions from the Selberg class). 
Moreover as in Sect. 8 one may try to work out the pair correlation of zeros of other 
related functions such as Z; (s). Furthermore, our method is applicable to products of 
not just two Dirichlet L-functions, but in fact to products of k Dirichlet L-functions. 
In any such problem it is desirable to obtain results which have implications for 
the distribution of zeros on the critical line. Another problem of interest is to form 
conjectures about the behaviour of the correlations of Theorems | and 2 for x > T 
as Montgomery [17] did for the pair correlation of zeta zeros. Montgomery drew 
upon the Hardy—Littlewood conjecture about sums of the kind > A(n)A(n + h) 
nsx 
for formulating the conjecture given in (8). The conjecture that all zeta zeros are 
simple also provided guidance in the sense that his conjecture should agree with 
it. In the case of the correlation sums of Theorems | and 2, extension to x > T 
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will involve formulating Hardy—Littlewood type of conjectures for the arithmetic 
functions A*(n) and A*(n), or else one may rely upon the random matrix model to 
first formulate the analogues of the pair correlation conjecture (10) and then pass to 
the analogues of (8) therefrom. 


Acknowledgements C.Y. Yildirim thanks Professors Enrico Bombieri and Andrew Granville for 
helpful conversations and suggestions. 
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Bagchi’s Theorem for Families of Automorphic 
Forms 


E. Kowalski 


Abstract We prove a version of Bagchi’s Theorem and of Voronin’s Universality 
Theorem for the family of primitive cusp forms of weight 2 and prime level, and 
discuss under which conditions the argument will apply to a general reasonable 
family of automorphic L-functions. 


1 Introduction 


The first “universality theorem” for Dirichlet series is Voronin’s Theorem [18] for 
the Riemann zeta function, which states that for any r < 1/4, any continuous 
function g defined and non-vanishing on the disc |s — 3/4] < rin C, which is 
holomorphic in the interior, and any ¢ > 0, there exists tf € R such that 


max |C(s + it) —g(s)| <e. 
|s—3/4|<r 


In other words, up to arbitrary precision, any function g can be approximated by 
some vertical translate of the Riemann zeta function. 

Bagchi, in his thesis [1], provided a clear conceptual explanation of this result, 
as the combination of two independent statements: 


¢ Viewing translates of the Riemann zeta function by t € [—7,T] as random 
variables with values in a space of holomorphic function on the disc, Bagchi 
proves that these random variables converge in law, as T — +00, to a natural 
random Dirichlet series, which is also expressed as a random Euler product; 

¢ Computing the support of the limiting random Dirichlet series, and checking that 
it contains the space of nowhere vanishing holomorphic functions on the disc, 
the universality theorem follows easily. 


The key step, from our point of view, is the first part, which we call Bagchi’s 
Theorem. Indeed, once the convergence in law is known, it follows that there is 
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“some” universality statement, with respect to the functions in the support of the 
limiting random Dirichlet series. The second step makes this support explicit. (This 
might be compared with Deligne’s Equidistribution Theorem, as applied to families 
of exponential sums, for instance: Deligne’s Theorem shows that there is always 
some equidistribution of these sums.) 

The goal of this note is to give a first example of a genuinely higher-degree 
statement of this type, and to deduce the corresponding universality statement. We 
will also indicate a general principle that should apply in many more cases. 


Theorem 1 (Universality in Level Aspect) For q prime > 17, let S2(q)* be the 
non-empty’ finite set of primitive cusp forms for T(q) with weight 2 and trivial 
nebentypus. For f € S2(q)*, let L(f, s) denote its Hecke L-function 


Lif,s) = Do Aen, 


n21 


normalized so that the critical line is Re(s) = 5. 

For any real number r < i let D be the open disc centered at 3/4 with radius 
r. Then for any continuous function g@ : D—>C which is holomorphic and non- 
vanishing in D and satisfies 


y(o) > Oforao Ee DNR, (1) 


we have 
liminf ———_[{f € S)(q)* | IL(f,-) — Glleo < £}| > 0 
1m int ——— 7 € 
q>+00 |52(q)*| ae ae 


for any ¢ > 0, where the L® norm is the norm on D. 

The main difference with previous results involving cusp forms (the first one 
being due to Laurincikas and Matsumoto [13]) is that we do not fix one such L- 
function L(f,s) and consider shifts (or twists) L(f,s + it) or L(f x x, 5), but rather 
we average over the discrete family of primitive forms in S2(q)*. It is also important 
to remark that the condition (1) is necessary for a function on D to be approximated 
by L-functions L(f, 5) with f € S2(q)*. (We will give more general statements where 
the discs D are replaced with more general compact sets in the strip 5 <o <1). 

We will prove this Theorem in Sect. 2, after stating the results generalizing the 
two steps of Bagchi’s strategy for the zeta function. The proof of Bagchi’s Theorem 
for this family is an analogue of a proof for the Riemann zeta function that is simpler 
than Bagchi’s proof (it avoids both the use of the ergodic theorem and any tightness 
or weak-compactness argument). 


'We assume q > 17 to ensure this property; it also holds for g = 11. 
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In Sect. 5, we discuss very briefly how this strategy can in principle be applied to 
very general families of L-functions, as defined in [8]. 
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Notation 


As usual, |X| denotes the cardinality of a set. By f « g for x € X, orf = O(g) for 
x € X, where X is an arbitrary set on which f is defined, we mean synonymously that 
there exists a constant C > 0 such that |f(x)| < Cg(x) for all x € X. The “implied 
constant” is any admissible value of C. It may depend on the set X which is always 
specified or clear in context. We write f = gif f < g and g </ are both true. 

We use standard probabilistic terminology: a probability space (92, 4’, P) is a 
triple made of a set 2 with a o-algebra and a measure P on Y with P(Q) = 1. 
We denote by E(X) the expectation on §2. The law of a random variable X is the 
measure v on the target space of X defined by v(A) = P(X € A). IfA C Q, then 1, 
is the characteristic function of A. 


2 Equidistribution and Universality for Modular Forms in 
the Level Aspect 


We will prove Theorem | by combining the results of the following two steps, each 
of which will be proved in a forthcoming section. Throughout, we assume that q is 
a prime number = 17. 

Step 1. (Equidistribution; Bagchi’s Theorem) For g prime, we view the finite 
set S2(q)* as a probability space with the probability measure proportional to the 
“harmonic” measure where f € S2(q)* has weight 


l 
ff) 
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in terms of the Petersson inner product. We write correspondingly E,(-) or P,(-) for 
the corresponding expectation and probability. Hence there exists a constant cy > 0 
such that 

Cq 


ff) 


E,(g(f))= >> v(f) 


fES2(q)* 


for any y: S2(q)* — C. From the Petersson formula, it is known that c, > 1/(477) 
as q — +00 (see, e.g., Iwaniec and Kowalski [7, Chap. 14] or Cogdell and 
Michel [3]). 

Let D be a relatively compact open set in C such that D is invariant under complex 
conjugation. Define #{(D) to be the Banach space of functions holomorphic on D 
and continuous and bounded on D, with the norm 


IIPlloo = sup |9(s)]. 
sED 


This is a separable complex Banach space. Define also 7p(D) to be the set of p € 
H(D) such that f(s) = f(s) for all s € D (this is well-defined since C is assumed to 
be invariant under conjugation). Note that the L-function of f, restricted to D, is an 
element of 7r(D) since the Hecke eigenvalues A,/(n) are real for all n > 1. 

We define L, to be the random variable S2(q)* — H(D) mapping f € S2(q)* to 
the restriction of L(f,s) to D. (This depends on D, but the choice of D will always 
be clear in the context.) 

If D is a compact subset of the strip 5 < Re(s) < 1, then we will show that 
L, converges in law to a random Dirichlet series. To define the limit, let (X,), be a 
sequence of independent random variables indexed by primes, taking values in the 
matrix group SU2(C) and distributed according to the probability Haar measure on 
SU2(C). 


Theorem 2 (Bagchi’s Theorem for Modular Forms) Assume that Disa compact 
subset of the strip f < Re(s) < 1. Then, as q— + 00, the random variables L, 
converge in law to the random Euler product 


Lyp(s) = I] det(1 —X,p“)"! = [a — Tr(X,)p7$ + p21)! 
7 P 


which is almost surely convergent in H(D), and belongs almost surely to Hg(D). 
Step 2. (Support of the random Euler product) 
To deduce Theorem | from Theorem 2, we need the following computation of 
the support of the limiting measure. 


Theorem 3 Suppose that D is a disc with positive radius and diameter a segment 
of the real axis, always with D contained in 5 < Re(s) < 1. The support of the law 
of the random Euler product Lp contains the set of functions ¢ € H(D) such that 
g(x) > Oforxe DOR. 
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Note that since DNR is an interval of positive length in R, the condition g(x) > 0 
for all x € DMR implies by analytic continuation that g € Hp(D), which by 
Bagchi’s Theorem 2 is a necessary condition to be in the support of Lp. 

Step 3. (Conclusion) The elementary Lemma | below, combined with Theo- 
rems 2 and 3, implies Theorem | in the form 


wo ath Cq 

lim inf Py(IIL(F.) — @lloo < ©) = lim inf 2, pA? (2) 
fES2(q)* , 
IL¢.)—¢lloo<e 


for any function g as in Theorem | and any ¢ > 0. We can easily deduce the 
“natural density” version from this: let A be the set of those f € S2(q)* such that 
IIL(f.-) — loo < &; then for any parameter 7 > 0, the definition of the harmonic 
measure on S2(q)* gives 


: (A) (ff) 
in@iy 1= EMO) = (P(A) = Pl Sor e n)). 


There exists 6 > 0 such that the first term is > 6 > 0 for all g large enough by (2); on 
the other hand, a result of Cogdell and Michel [3, Corollary 1.16] and the classical 
relation between the Petersson norm and the symmetric square L-function at s = 1 
(see, e.g., [7, (5.101)]) imply that we can find 7 > 0 such that 


(rf) ; 
iin, Pl s@r <1) <>. 


For this value of 7, we obtain 


1 3 
lim int ——— )-1> 50. 
q>too |S2(q)*| = 2 


More precisely, the result of Cogdell—Michel is that for any 7 > 0, we have 
lim P,(L(Sym’f, 1) < n) = F(logn) 
q> +00 


where F is the limiting distribution function for the special value at | of the 
symmetric square L-function of f € S(q)*. Since F(x) — 0 when x > —oo, 
we obtain the result. 


Remark I It would also be possible to argue throughout with the uniform prob- 
ability measure on S2(qg)*; the only change would be a slightly different form of 
Theorem 2, where the random variables (X,) would not be identically distributed 
(compare with the equidistribution theorems of Serre [16] and Conrey et al. [4]). 
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Lemma 1 Let M be a separable complete metric space and (X,) a sequence of 
random variables with values in M that converge in law to X. Let S be the support 
of the law of X. Then for any x € S, and any open neighborhood U of x, we have 


lim inf P(X, € U) > 0. 
n—>+oo 


Proof By classical criteria for convergence in law, we have 


lim inf P(X, € U) > P(X € U) (3) 
n—>+o0o0 


for any open set U C X (see, e.g., [2, Theorem 2.1 (iv)]). Since x € S, we have 
P(X € U) > 0, hence the result. Oo 


3 Proof of Theorem 2 


We begin with some preliminaries concerning the random Euler product Lp. In fact, 
if will be convenient to view it as a holomorphic function on larger domains then 
D, in a way that will be clear below. For this purpose, we fix a real number o9 such 
that 5 < 00 < 1, and such that the compact set D is contained in the half-plane S 
defined by Re(s) > oo. 

We recall that for v > 0, the d-th Chebyshev polynomial is defined by 


U,,(2 cos(x)) = — 


The importance of these polynomials for us lies in their relation with the represen- 
tation theory of SU2(C), namely 


U,(2.cos(x)) = Tr (som (“ “) 
0 e Ix 


for any x € R, where Sym® is the d-th symmetric power of the standard two- 
dimensional representation of SU2(C). 
We define a sequence of random variables (Y,,),>1 by 


Y, = I] U,(Tr(X,)). 
p"||n 


In particular, we have Y,Y,, = Ym ifn and m are coprime, and Y, = Tr(X,) if p is 
prime. The sequence (Y,,) is independent and Sato—Tate distributed. Moreover, since 
|U,(0)| < v + 1 for all v > 0 and all t € R, we have 


Yl < []@ +) =d@) «n’ 


p®||n 


for n = 1 and e > 0, where the implied constant depends only on e. 
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Lemma 2 (1) Almost surely, the random Euler product 


| [deta — x,p™) 7 
Dp 


converges and defines a holomorphic function on S. In particular, Lp converges 
almost surely to define an H(D)-valued random variable. 
(2) Almost surely, we have 


[ [deta —x,p)! =D Yan 
P n21 


for all s € S, and in particular Lp coincides with the random Dirichlet series 
on the right. 
(3) Foro > 1/2 and u = 2, we have 


H(i 


nu 


2 
\«1 


where the implied constant depends only ono. 


Proof (1) Let o be a fixed real number such that ; < 0 < 6p. By expanding, we 
can write 


—log det(1 — X,p~*) = Y,p* + p(s) 
where the random series 


AO) 
Pp 


converges absolutely (and surely) for Re(s) > 1/2. Since E(¥,p'°) = 0 
and E(Yp *?) = p *°, Kolmogorov’s Three Series Theorem (see, e.g., [14, 
Theorem 0.JII.2]) implies that the random series 


tp? 
p 


converges almost surely. By well-known results on Dirichlet series, this means that 
the random series 


dtp 
Dp 
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converges almost surely to a holomorphic function on the half-plane S. This implies 
the first statement by taking the exponential. The second follows by restricting to D 
since D is contained in S. 

(2) We first show that, almost surely, the random Dirichlet series 


L(s) = = Y,n° 


n21 


converges and defines a function holomorphic on S. The key point is that the 
variables Y,, for n squarefree form an orthonormal system: we have 


E(¥,:Yn) = 6(n, m) 


if n and m are squarefree numbers. Indeed, if n 4 m, there is a prime p dividing 
only one of n and m, say p | n, and then by independence we get E(¥nYn) = 
E(Y,)E(n/p Ym) = 0; and if n = m is squarefree, then we have 


E(¥,) = | [E() =1. 


pina 


Fix again o such that 5 < 0 < 09. By the Rademacher—Menchov Theorem (see, 
e.g., [9, Theorem B.8.4]), the random series 


b = 7 
) Y,n 
n 


over squarefree numbers converges almost surely. By elementary factorization and 
properties of products of Dirichlet series (the product of an absolutely convergent 
Dirichlet series and a convergent one is convergent, see, e.g., [6, Theorem 54]) the 
same holds for 


s Y,n °. 


n21 


As in (1), this gives the almost sure convergence of the series defining L(s) to a 
holomorphic function in S. Restricting gives the H(D)-valued random variable Lp. 

Finally, almost surely both the random Euler product and the random Dirichlet 
series converge and are holomorphic for Re(s) > oo. For Re(s) > 3/2, they con- 
verge absolutely, and coincide by a well-known formal Euler product computation: 
for any prime p and any x € R, denoting 


= (5 x) 


Bagchi’s Theorem for Families of Automorphic Forms 189 


we have 


det(1—#(@x)p*)' = —e*p*) (1 -e 8p)! 
= by Tr Sym’ (t(x))p* = > U,(x)p 
v20 v20 


(compare the discussion of Cogdell and Michel in [3, Sect.2]). By analytic 
continuation, we deduce that Lp = Lp almost surely as H(D)-valued random 
variables. 

(3) Since the random variables Y,, are real-valued, we have 


B(D n= ) _ Py Tone Eat. 


For given n and m, let d = (n,m) andn’ = n/d,m' = m/d. Then by multiplicativity 
and independence of the variables (Y,),, we have 


E(Y,, Yn) = E(¥j)E(%v )E(n’). 
By the definition of Y,, we have E(Y,’) = 0 ifn’ is divisible by a prime p with odd 


exponent, and similarly for E(Yin’). Hence we have E(Y,,Y,,) = 0 unless both n’ and 
m’ are squares. Therefore 


B(|) Y,n ° 


nu 


2 E(Y7) 1 
< EQ nYn) < 
) 2 2° 2 (mn)2° ( ) +00 


since 0 > 1/2 and E(Y,) « n® for any ¢ > 0. oO 
The key arithmetic properties of the family S2(q)* of modular forms that are 
required in the proof of Theorem 2 are the following: 


Proposition 1 (Local Spectral Equidistribution) As g¢ — +00, the sequence 
(A¢(P))p of Fourier coefficients of f € S2(q)* converges in law to the sequence 
(Yp)p: 


Proof This is a well-known consequence of the Petersson formula, see, e.g., [11, 
Proposition 8], [12, Appendix] or [3, Proposition 1.9]; here restricting to prime level 
q and weight 2 also simplifies matters since this ensures that the old space of S2(q) 
is zero. oO 


Proposition 2 (First Moment Estimate) There exists an absolute constant A = 1 
such that for any real number 6 > 0 with 6 < 1/2, and for any s € C such that 
5 +6 < Re(s), we have 


E,(IL(,5)|) K (+ |s/)*, 


where the implied constant depends only on 6. 
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Proof This follows easily, using the Cauchy—Schwarz inequality, from the second- 


moment estimate [10, Proposition5] of Kowalski and Michel (with A = 0); 
although this statement is not formally the same, it is in fact a more difficult average 
(it operates closer to the critical line). oO 


We now prove some additional lemmas. 


Lemma 3 (Polynomial Growth) For any real number o > oo, we have 
B(\E 
n21 


uniformly for all s such that Re(s) = 0 > oo. 


) «1+ 


Proof We write 


L(s) = > Y,n *. 


n21 
This is almost surely a function holomorphic on the half-plane S. The series 


Y, 


ne 
n21 


converges almost surely. Therefore the partial sums 


Y, 
Sy = — 


o0 
nu 


are bounded almost surely. By summation by parts, it follows from the convergence 
of the series L(s) that for any s with real part Re(s) => o > oo, we have 


+oo 5 
L(s) = (s- 0) / —7 


where the integral converges almost surely. Hence almost surely 


Sul 


yo-oitl 


u. 


+00 
IL(s)| < (1 + Ist) / 


Fubini’s Theorem and the Cauchy—Schwarz inequality then imply 


+00 d 
BILE) <0+b) [BOSD sSors 


<K 1+ |s| 


yoo! 


+00 
< (1+ |s) i E((S,[2)'2 
1 


by Lemma 2 (3). | 
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We now consider some elementary approximation statements of the L-functions 
and of the random Dirichlet series by smoothed partial sums. For this, we fix once 
and for all a smooth function g: [0,-+co[— [0, 1] with compact support such that 
(0) = 1, and we denote ¢ its Mellin transform. 

We also fix T > 1 and a compact interval J in ]1/2, 1[ such that the compact 
rectangle R = I x [—T,T] C C is contained in S and contains D in its interior. We 
then finally define 6 > 0 so that 


min{Re(s) | se R}= ; + 26. 


Lemma 4 For N = 1, define the H(D)-valued random variable 
Ly = %n(=)a 
‘D n N : 
n21 
We then have 


E(([Lp — Lp lloo) < N~* 
for N = 1, where the implied constant depends on D. 
Proof We again write 


L(s) = = Y,n* 


n21 


when we wish to view the Dirichlet series as defined and holomorphic (almost 
surely) on S. 
For any s in the rectangle R, we have almost surely the representation 


1 
L(s) — L (s) = -—— / L(s + w)6(w)N" dw (4) 
2in (—8) 
by standard contour integration.” 


We also have almost surely for any v in D the Cauchy formula 


1 


d 
Lp(v) = 19 (0) = = / 69) - LO) 


?Here and below, it is important that the “almost surely” property holds for all s, which is the case 
because we work with random holomorphic functions, and not with particular evaluations of these 
random functions at specific points s. 
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where the boundary of R is oriented counterclockwise. The definition of the 
rectangle R ensures that |s — v|~! >> 1 for v € Dands € OR, and therefore 


Io ~ 25 loo < f We(s) ~ 2%) ds. 
Using (4) and writing w = —6 + iu with u € R, we obtain 
Lp — LY) loo K NE [fe |L(—6 + iu + s)| |6(—6 + iw)||ds|du. 
Therefore, taking the expectation, and using Fubini’s Theorem, we get 


E(|Lp a is) < no | / E(|L(—4 + iu + s)|) |@(—6 + iu)||ds|du 
ar JR 


<N~* sup if E(|L(—é + iu + o + it)|) |@(—6 + iu)|du. 
R 


s=o+itEeR 


We therefore need to bound 
/ E(|L(—é + iu + o + it)|) |G(—6 + iu)|du. 
R 


for some fixed o + if in the compact rectangle R. The real part of the argument 
—§+iut+tort+itiso—d> 5 + 6 by definition of 5, and hence 


E(|L(—6 + iu+o + it)|)<1+|]-é+imu+o-+it| «14 {ul 


uniformly for o + it € R and u € R by Lemma 3. Since @ decays faster than any 
polynomial at infinity, we conclude that 


/ E(|L(—6 + iu +o + if)|) |G(—8 + iu)|du « 1 
R 


uniformly for s = o + it € R, and the result follows. Oo 
We proceed similarly for the L-functions. 


Lemma 5 For N > | andf € S2(q)*, define 


LO¢,s) =~ Aeine( a 


n21 


and define L” to be the H(D)-valued random variable mapping f to L“(f,s) 
restricted to D. We then have 


E,(IIb_ — Lo Ilo) KN? 


for N = \andaill q. 
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Proof For any s € R, we have the representation 


1 “A Ww 
L(f,s) —-L™ (f,s) = re , L(f,s + w)@(w)N"dw. (5) 
(— 


and for any v with Re(v) > 1/2, Cauchy’s theorem gives 


1 
2in 


L(f.v) —L (fv) = / ff.) - LE. ye, 


where the boundary of R is oriented counterclockwise. As in the previous argument, 
we deduce that 


Ly — L™ Ileo < / IL(f.s) — Lf. s)l|ds| 
oR 


for f € S2(q)*. Taking the expectation with respect to f and changing the order of 
summation and integration leads to 


E4(Iby— Lf loo) < ff Ba(lL¢f.5) — LE. 9) 


< sup E,(|L(f,s) — Lf, s)]). (6) 


sEoR 


Applying (5) and using again Fubini’s Theorem, we obtain 
E,(|L(f,s) — Lf, s)|) « N° : |o(—6 + iu)|E,(\L(f, —6 + iu + 0 + if)|)du 
for s € OR. Since o — 5 = 5 + 6, we get 
E,(|L¢,—5 + iut+o + it)|) < (+ |ul)4 (7) 
by Proposition 2, where A is an absolute constant. Hence 
E,(\L&,s) -L™,s)|) KN? I |\G(—6 + iu)|(1 + |ul)4du « N~. (8) 
We conclude from (6) that 
E,(IIby— Lo lloo) « N, 


as claimed. oO 
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Proof (Proof of Theorem 2) A simple consequence of the definition of convergence 
in law shows that it is enough to prove that for any bounded and Lipschitz function 


f : H(D)—C, we have 
E,(f(Lq))—E((Lp)) 
as q— + oo (see [2, p. 16, (ii)= (iii) and (1.1), p. 8]). To prove this, we use the 


Dirichlet series expansion of Lp given by Lemma 2 (2). 
Let N = 1 be some integer to be chosen later. Let 


L™ = > Aroyno(—) 


n21 


(viewed as random variable defined on S3(q)*) and 


Ly =D Yao) 


n21 


be the smoothed partial sums of the Dirichlet series, as in Lemmas 5 and 4. 
We then write 


|E,(f(La)) — E(G¢())| < [Eg(f(Lg) — FL) |+ 
[E,(f(L)) — EG(L™))| + JEG(L™) —fW)I. 


Since f is a Lipschitz function on (D), there exists a constant C > 0 such that 
If) —FO) < Cllx — ylhoo 


for all x, y € H(D). Hence we have 


IE, (F(L,)) — E(f(L))| < CE, (I[bLg — L loo) + 
IE, ¢(L™)) — EG (L™))| + CE(IL™ = Loo). 
Fix ¢ > 0. Lemmas 5 and 4 together show that there exists some N = 1 such that 
E,(|[Ly — L™ lloo) < € 
for all g = 2 and 


E(|L — Loo) < €. 
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We fix such a value of N. By Proposition | (and composition with a continuous 
function), the random variables L®) (which are Dirichlet polynomials) converge in 
law to L“) as q—> + 00. We deduce that we have 


|E,(f(Lq)) — EF(L))| < 4e 
for all g large enough. This finishes the proof. oO 


4 Proof of Theorem 3 


For the computation of the support of the random Dirichlet series L(s), we apply a 
trick to exploit the analogous result known for the case of the Riemann zeta function. 
We denote SU> the product of copies of the unit circle indexed by primes, so an 
element (x,) of SU; is a family of matrices in SU2(C) indexed by p. 

The assumptions on D in Theorem 3° imply that there exists t such that 1/2 < 
t < 1 andr > 0 such that 


D={seEC| |s—t]<r}c{seEC | 1/2 < Re(s) < l}. 


Lemma 6 Let N be an arbitrary positive real number. The set of all series 


Tr(x oan 
> G) ; (xp) € SU2 
p>N P 
which converge in H(D) is dense in the subspace Hp(D). 

In the proof and the next, we allow ourselves the luxury of writing sometimes 


IIP(s) lloo instead of ||P] o0. 


Proof Bagchi [1, Lemma 5.2.10] proves (using results of complex analysis due to 
Bernstein, Polya, and others) that the set of series 


p>N P 
that converge in #1(D) is dense in H(D) (precisely, he proves this for VN = 1, but the 
same proof applies to any value of NV). If gp € Hpr(D) and e > 0, we can therefore 
find real numbers (6,) such that 


ee 


ip ‘ 


co «62 


y PP 


3These assumptions could be easily weakened, as has been done for Voronin’s Theorem. 
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It follows then that 


| oO z is a 
2 DN p* “Tho , 
hence (since gp € Hr(D)) that 
i6 —i6, 
erp +e"? 
- ———] < 
|e) - | <6 


p>N 


which gives the result since 


el +o = Tr( © : 
0 e 


is the trace of a matrix in SU2(C). Oo 
We will use this to prove: 


Proposition 3. The support of the law of 


log Lp(s) = — » log det(1 — X,p“*) 
P 


in H(D) is Hp(D). 


Proof Since X, € SU2(C), the function log L(s) is almost surely in the space 
Hp(D). Since the summands are independent, a well-known result concerning the 
support of random series (see, e.g., [9, Proposition B.8.7]) shows that it suffices to 
prove that the set of convergent series 


- ye log det(1 — x,p*), (Xp) € SUs, 
P 


is dense in Hp(D). Denote L(s; (xp)) this series, when it converges in 1(D). 
We can write 


ay Tr(xp) 
— > Slog det(1 — xp") = )> — + a(s; ()) 
Pp p P 
where s +> g(s; (xp)) is holomorphic in the region Re(s) > 1/2. Indeed 


8(S; (Xp)) = y log(1 +4 Lae), 
Pp 


k20 
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Fix g € Hr(D) and let ¢ > 0 be fixed. There exists N > 1 such that 


|s a Yea! + 2, Mp kp “t+28) 


. 2 (9) 


for any (Xp) € SU. Now take Xp = 1 € SU2(C) for p < N and define 


Tr(xp) 
g=ot+) —*=9+2 a — 
p<N ae 
which belongs to Hp(D). By Lemma 6, there exist x, for p > N in SU2(C) such that 


= es gils) |. <é 


p>N 


The left-hand side is the norm of 


ma 


log L(s; (xp)) — g(s; @p)) — D> 


DSN 


— gi(s) = log L(s; (xp) — PCs) — 8G (%)), 


and by (9), we obtain 


|| log L(s; %p)) — G(s) loo < 2e. 


This implies the lemma. Oo 

Using composition with the exponential function and a lemma of Hurwitz (see, 
e.g., [17, 3.45]) on zeros of limits of holomorphic functions, we see that the support 
of the limiting Dirichlet series Lp in H(D) is the union of the zero function and the 
set of functions g € Hp(D) such that g(o) > 0 fora € DAR. In particular, this 
proves Theorem 3. 


5 Generalizations 


It is clear from the proof that Bagchi’s Theorem should hold in considerable general- 
ity for any family of L-functions. Indeed, the crucial ingredients are the local spectral 
equidistribution (Proposition 1) and the first moment estimate (Proposition 2). 

The first result is a qualitative statement that is understood to be at the core of 
any definition of “family” of L-functions (this is explained in [8], but also appears, 
with a different terminology, for the families of Conrey et al. [5] and Sarnak et al. 
[15]); it is now know in many circumstances (indeed, often in quantitative form). 
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The moment estimate is typically derived from a second-moment bound, and 
is also definitely expected to hold for a reasonable family of L-functions, but it 
has only been proved in much more restricted circumstances than local spectral 
equidistribution. However, it is very often the case that one can at least prove 
(using local spectral equidistribution) a weaker statement: for some o, such that 
1/2 < o, < 1, the second moment of the L-functions satisfies the analogue of 
Proposition 2; an analogue of Bagchi’s Theorem then follows at least for compact 
discs in the region 0; < Re(s) < 1. 

As far as universality (1.e., Theorem 3) is concerned, one may expect that (using 
tricks similar to the proof of Theorem 3) only two different cases really occur, 
depending on whether the coefficients of the Z-functions are real (as in our case) 
or complex (as in the case of vertical translates of a fixed L-function). 
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The Liouville Function and the Riemann 
Hypothesis 


Michael J. Mossinghoff and Timothy S. Trudgian 


Abstract For nearly a century, mathematicians have explored connections between 
the Liouville function and the Riemann hypothesis. We describe a number of con- 
nections regarding oscillations in sums involving the Liouville function, including 
Lo(x) = Yo ,<, A(n), first studied by Polya, and Li (x) = )°-, A(n)/n, explored by 
Turan. We establish new lower bounds on the size of the oscillations in such sums. 
In particular, we prove that Lo(x) > ./x infinitely often, show that L)(x) < —1/./x 
infinitely often, and investigate sign changes in a family of functions that interpolate 
these functions, namely Ly(x) = >>. A(n)/n*, where a € [0, 1]. 


Nx 


1 Introduction 


The Liouville function A(n) is the completely multiplicative arithmetic function 
whose value is —1 at each prime, so A(n) = (—1)°™, where Q(n) is the number 
of prime factors of n, counting multiplicity. For nearly 100 years mathematicians 
have explored connections between this function and the Riemann hypothesis. 
Ultimately, these are connected by the Dirichlet series for the Liouville function: 
for }i(s) > 1, using the Euler product representation of the zeta function we obtain 


5 (i — —2s)—1 a 4k Ptr 
(2s) _ T],d—p~*) =lloweateype ay 
P 


~ — p-s)-1 ks s 
c(s) I],a P ) p k>0 P n=l u 


Here and throughout this article, s denotes a complex variable with real part o and 
imaginary part ¢. 
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For a nonnegative real number q, let Ly (x) denote the following weighted sum: 


A 
Loa (1) 


OL 


nsx 


The two special cases a = 0 anda = | have a long history in analytic number 
theory, in topics surrounding the Riemann hypothesis. 

Polya investigated the sum Lo(x) [which often appears in the literature as L(x)] 
in a 1919 publication [27]. His main result was a proof that if p > 7 is a prime with 
p = 3 mod 4, such that the quadratic imaginary number field Q(,/—p) has class 
number 1, then Lo(x) must vanish at x = (p — 3)/4. He then reported computing 
Lo(x) up to approximately 1500, and noted that it was never positive over this range, 
after the trivial value Lo(1) = 1. Polya wrote: 


I mention this observation, in order to prompt perhaps further numerical research. The proof 
of (1) [referring to Lo(n) < 0], even only for sufficiently large n, would produce a proof of 
the Riemann hypothesis, after known properties of the ¢-function and a function-theoretic 
result of Mr. Landau. 


The statement that Lo(n) < O for sufficiently large n is often described in 
the literature as Pélya’s conjecture, but in fact Polya only noted this implication, 
and did not conjecture its validity, at least in print. We therefore employ the term 
Poélya’s problem in this article for the question of sign changes, and more broadly of 
oscillations, in the function Lo(x). 

Turan studied the weighted sum L;(x) [which often appears in the literature as 
T(x)] in 1948 [31], in connection with the zeros of the truncated series for the zeta 
function, U,(s) = }°,., 1/n’. Turan proved that if there exists a positive constant 
K such that L;(x) > —K/./x for sufficiently large x, then U,(s) does not vanish 
in the half plane 0 > 1 + K/./x, which would imply the Riemann hypothesis. 
(Somewhat weaker conditions suffice as well; see [31, 32].) Turan reported that a 
coterie of Danish colleagues computed L;(x) for x < 1000, and found that this 
function remained positive throughout this range. Perhaps because of this, in later 
literature the statement that L; (x) is never negative is sometimes stated as Turdn’s 
conjecture. However, in a later paper [32] Turan took issue with the term: 


Strangely, this [referring to the positivity of Li(n)] is described in subsequent work as 
“Turan’s conjecture,” even claimed for n > 1, but this occurs nowhere in [31], forn > 1 
not even implicitly claimed. 


We use the term Turdn’s problem in this article to refer to the question of sign 
changes and oscillations in L) (x). 

Turan’s original motivation for studying this problem was settled by Montgomery 
in 1983 [22], who proved that U,(n) does have zeros some distance to the right of 
the o = 1 line in the complex plane. He proved that for every positive constant 
c< 2 —1 and every integer x > xo(c) the function U,(s) has a zero in the half plane 
o > 1+ cloglogx/ log x. 

A good deal is known about both Pélya’s and Turan’s problem. In 1958, 
Haselgrove [11] proved that both Lo(x) and L;(x) change sign infinitely often. 
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Lehman [19] first determined an explicit location where Lo(x) is positive, proving 
in 1960 that Lo(906 180359) = 1. The location of the first sign crossing of 
Lo(x) for x > 2 was not found until 1980, when Tanaka [30] established that 
this occurs at x = 906150257. In Turan’s problem, the first sign crossing was 
found more recently by Borwein, Ferguson, and the first author [5]. It occurs at 
x = 72 185 376 951 205. 

It is well known (and was certainly known to Polya and Turan) that the Riemann 
hypothesis would follow if one could bound the oscillations in Lo(x) or L; (x). The 
Riemann hypothesis, as well as the simplicity of the zeros of €(s), would follow 
if there exists a positive constant C such that any one of the following inequalities 
holds for sufficiently large x: 

Lo(x) <C. L(x) 

Vx Vx 

This follows from Landau’s theorem regarding singularities of the Laplace trans- 
form of a nonnegative function (see, for instance, [23, Lemma 15.1]). 

It is easy to see that bounding Lo(x)/./x on both sides would yield the Riemann 
hypothesis. Starting with the Dirichlet series for the Liouville function in the 
complex variable s = o + it, one may compute 


62s) An) _ Lon) —Lo(n— 1) | t .4 
C(s) ae ee nv’ =D (; ayy 


>-C, L(x) /x <C, L(x) Vx >-C. (2) 


n>1 n=l n21 
n+1 lee) 
s Lo(x) 
= tain) f Pan sf ea 
n 1 
n>1 


(3) 
so that clearly Lo(x) = O(./x) implies that the right side of (3) is analytic ono > 


1/2, and therefore so is the left side, which would establish the Riemann hypothesis. 
One can deduce that the zeros of the zeta function on the critical line are all simple 
in this case as well. 

A result of Ingham [14] in 1942 cast some doubt on the possibility of proving the 
Riemann hypothesis and the simplicity of the zeros of ¢(s) by bounding Lo(x)/ ./x or 
similar functions. Ingham proved that such a result would imply considerably more: 
it would follow that infinitely many integer relations exist among the ordinates of 
the zeros of ¢(s) on the critical line in the upper half plane. Since there seems to be 
no other particular reason to suspect the existence of such relations, and no evidence 
of such relations has been detected, it seems plausible that these (suitably scaled) 
sums involving the Liouville function may be unbounded. In this article, we focus 
on obtaining new and larger bounds on these oscillations. 

Some prior bounds on oscillations for Lo(x) are known. Anderson and Stark [2] 
proved that single values of Lo(x) could be employed to quantify the oscillations of 
Lo(x)/./x. They showed that 


Lo(x) _ Lo(xo) — 1) 
lim sup = , 


X00 f/x /Xo 


(4) 
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where /(x) satisfies |J(x) — 1] < eee =~ 6.437/,/x. Lehman [19] reported 


that Lo(906 400000) = 708, whence by (4), Lo(x) > 0.0234833./x infinitely 
often. Tanaka [30] found that Lo(x) reaches a maximum value of 829 for x < 
10°, and while he did not report that this value occurred at x = 906316571, 
with this datum (4) shows that Lo(x) > 0.0275036./x infinitely often. The 
more recent work of Borwein, Ferguson, and the first author [5] determined that 
Lo(351 753 358 289 465) = 1,160,327, which establishes that 


Lo(x) > 0.0618672./x 


infinitely often. Further, as noted by Humphries [13], the method of Anderson 
and Stark also shows that lim inf, Lo(x)/./x < (Lo(x0) — I(x0))/./x0 for any 
particular xo. The data point Lo(72 204113780255) = —11805117 from [5] 
establishes that 


Lo(x) < —1.389278./x 


infinitely often. 

In this article, we prove even larger oscillations in Lo(x). Due to some symmetry 
in these problems, our results apply equally to L) (x), after suitable scaling. We prove 
the following result in Sect. 6. 


Theorem 1 Each of the following inequalities is satisfied for infinitely many 
integers x: 


Lo(x) > 1.0028. Vx, Lo (x) < —2.3723/x, 
L(x) < -1.0028//x, Li (x) > 2.3723/ Vx. 


The more general family Ly (x) with a € [0, 1] was studied by the authors [24], 
by Humphries [13], and by Akbary et al. [1]. It will be seen in Sect. 2 that if Ly (x) 
eventually has no sign changes for some a € [0, 1/2), or if Ly (x) — €(2a)/f(a) 
eventually has no sign changes for some a € (1/2, 1], then the Riemann hypothesis 
follows, as well as the simplicity of the zeros, and the existence of infinitely many 
integer relations among the ordinates of the zeros of the zeta function in the upper 
half plane. Because of this, in [24] the authors raised the question of whether these 
functions do change sign infinitely often. Our methods in this article allow us to 
resolve this question for almost 60% of the interval [0, 1]. We establish the following 
theorem in Sect. 6. 


Theorem 2 For each a € [0,0.29714...), the function Ly(x) exhibits infinitely 
many sign changes, and the same conclusion holds for each a € (0.70285..., 1] 
for the function Ly (x) — €(2a)/6 (a). 

Some additional explicit results regarding the oscillations of L, (x) for particular 
values of a appear in Sect. 6 (see Theorem 8). 
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This article is organized in the following way. Section 2 describes a normalization 
of the functions Ly(x) that allows us to treat their oscillations in a uniform 
way. Section 3 is an interlude on sums involving the Md6bius function. Section 4 
reports on a method involving the so-called weak independence of a set of real 
numbers, which quantifies oscillations in sums of arithmetic functions. Section 5 
then describes a more recent strategy developed by Best and the second author [4] 
to obtain improved bounds on oscillations for sums involving the Mobius function. 
Section 6 describes an adaptation of this method to sums involving the Liouville 
function, and establishes Theorems | and 2. Finally, Sect.7 briefly discusses the 
potential for future improvements. 


2 Generalizing the Problems of Pélya and Turan 


We begin by defining some functions that will allow a more uniform treatment of 
oscillation questions for the functions L,(x) with a ¢€ [0, 1]. In [24], it is shown 
that the Riemann hypothesis is true if and only if Ly (x) converges to €(2a)/C(@) for 
each a € (1/2, 1]. In fact, convergence at a particular w in this range is tantamount 
to showing that the zeta function has no zeros in the half plane o > aq, so in 
particular the fact that L;(x) converges to 0 is equivalent to the Prime Number 
Theorem. Moreover, it is shown in [24] that the Riemann hypothesis produces an 
error bound of O(x2-@+¢) for the quantity Ly (x) — ¢(2a)/€(a). A similar statement 
is shown to be equivalent to the Riemann hypothesis for the case of small a: for each 
a € [0, 1/2), this conjecture is equivalent to the statement that Ly (x) = O(x2-2+¢), 
We describe the special case a = 1/2 shortly, but following [24] we define 


La(x), if0<a<1/20ra=1, 
log x . _ 
Bier aay MEO (5) 
Ly(x) — ee if 1/2 <a <1, 
and we let 
Ag(u) = Zales, (6) 


Here, we have simply scaled (x) by the main term of the error bound (on the 
Riemann hypothesis), and substituted e” for x for convenience. We may now state 
the result from [24] that generalizes (2). 


Theorem 3 Suppose a € [0,1] is a fixed real number. If there exists a constant C 
such that Ay(u) — C has constant sign for all sufficiently large u, then the Riemann 
hypothesis follows, and all of the zeros of the zeta function are simple. 
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It is therefore natural to study oscillations in the family of functions Ag (u) with 
0 < a < 1. We exhibit some of these functions in Fig. 1. Parts (a) and (b) of this 
figure show L1/4(e") and Aj /4(u), respectively, over the range 22 < u < 12log10 + 
27.6. The horizontal line in part (b) shows the center of the oscillations of A,/4(u). 
We shall see in (9) below that this line occurs at y = 2/€(1/2) = —1.3695.... 
The plot for the corresponding function in Pélya’s problem, Ao(u), exhibits similar 
oscillations, but they are centered at y = 1/€(1/2) = —0.6847.... Thus, a sign 
change in Aj/4(u) requires an upward oscillation that is twice as high as what is 
required for a sign change in Pélya’s problem. 

Figure 1 parts (c) and (d) illustrate the transitional phase, a = 1/2: in (c), the 
oscillations center on the line u/2¢(1/2) — y/f(1/2) + €/(1/2)/2¢(1/2)7, as we 
shall see in (9). Last, parts (e) and (f) exhibit L3/4(e") and A3/4(u), respectively, 
and the positive bias with respect to the limit point is evident here. In part (e), the 
horizontal line shows the expected limiting value, ¢(3/2)/¢(3/4) = —0.7591..., 
and in part (f), the horizontal line shows the center of the oscillations, which in this 
case occurs at —2/€(1/2) = 1.3695..., again using (9) below. In Turén’s problem, 
the oscillations in A; (u) would center on half this value. 

In order to analyze the oscillations in the functions Ag(u), we require some 
additional definitions. Let F,,(s) denote the Laplace transform of Ag (u), 


Fy(s) = [ Aetne du. (7) 
0 


It is shown in [24] that F,(s) converges unconditionally in the half plane o > 1/2 
and converges in o0 > 0 on the Riemann hypothesis. The following formula is 
derived there: 


ful), if0 <a <1/20ra =1, 
Fu() = 1*© ~ sea yDe” ne (8) 
fa(s) — §Qa) if 1/2 <a <1, 


C(a)(s —a@ + 1/2)’ 
where 


€(. + 2s) 


Jal) = Tray 1/DEs + 1/D)" 


Notice that the term subtracted from fy (s) to create F,(s) eliminates the simple pole 
on the real axis ato = €(2a)/€(a) when a e€ (1/2, 1), and removes the second- 
order pole ato = 0 when a = 1/2. This is the source of the additional term in (5) 
for the case a = 1/2: we must ensure Fy(s) has only simple zeros in o > 0 in order 
to apply Ingham’s method. 

At this point, and in the remainder of this article, we may assume the Riemann 
hypothesis and the simplicity of the zeros of the zeta function. This is permissible 
because it follows from Landau’s theorem that if either the Riemann hypothesis is 
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- 1000 


eT 


- 1500 


22 23 #24 2 2 27 22 23 24 25 26 27 
(e) (f) 


Fig. 1 L,(e") and Ay(u) fora = 1/4, 1/2, and 3/4. (a) Lyja(e"). (b) Arja(u). (©) Li2(e"). 
(d) Aj /2(u). (e) L3/4(e"). (2) A3j4(u) 


false or €(s) has a multiple zero on the critical line, then lim sup,_,,, Aa(u) = 00 
and lim inf,+9 Ag (uw) = —oo (see [9, Sect. 8], [14, p. 318], and [13, Theorems 2.6, 
2.7]). With this, we then have that F,(s) is analytic ino > 0, except for a number of 
simple poles on the imaginary axis. We need to record the residues of these poles. 
At the origin, we find 


1 
ee if a € [0, 1/2) U(1/2, I], 
Res(F,,0) = 1 1 @SG/2) (9) 
9 ifa = 1/2 
(1/2) 2¢(1/2)?’ ~~ 
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where y denotes Euler’s constant, and if p, = 1/2+iy, is a zero of the zeta function 
on the critical line, then 


Res(Fy, iY’n) = a) gee ei: (10) 


(Pn > a)’ (Pn) , = ~ 
Ingham’s method now applies. Select a positive real number T, and let y;, ..., 
yn denote the ordinates of the zeros of the zeta function in the upper half plane with 
height at most T on the critical line. Let 


N 
Res(Fy, 0 Res(Fy, (Vn 
( ae 3 (Fu, i¥n) 


Fa(s) = s—ly, 


|n|=1 


so that F,(s)—F° (s) is analytic in the region o > 0,t € [—T, T]. Let k(t) be an even 
function supported on [—T, T] with k7(0) = 1, so that k7(f) is the Fourier transform 
of a nonnegative function K;(y). Let B%(u) denote the inverse Laplace transform of 
F*(s), except that each term is modified by a value of the weight function kr(f): 


Be (u) = Res(Fa,0) +29 S > kr(yn) Res(Fa, iyn)el™. (1) 


0<y,<T 


Ingham’s method then establishes the following inequalities. A proof may be found 
in [24]. 


Theorem 4 Let a € [0,1] and T > 1 be fixed real numbers. Then, using the 
notation above, 


lim inf Ay (uv) < lim inf B*(u) < BY (v) < limsup BF (u) < lim sup Ag(u), 
u>oo uo u>oo u>oo 


for any positive real number v. 

Thus, to establish that the functions A,(u) oscillate to some large degree, it 
is enough to show that the function B*¥(v) achieves particularly large values, and 
achieves particularly small values. This is a much more attractive setting because 
B* (v) is a finite sum. Haselgrove [11] used this approach to establish that infinitely 
many sign changes occur in both Pélya’s and Turan’s problem. Using T = 1000, he 
computed that Bj (831.846) is positive, and that BY (853.853) is negative. Lehman 
[19] and Borwein et al. [5] also employed these functions to determine locations 
where Lo(x) or L; (x) appeared likely to change sign. 

We remark that the term corresponding to the first zero of the zeta function on 
the critical line, y) = 14.1347..., drives the overall periodic structure of A,(u). 
Figure 2 illustrates this with a plot of A;/4(u) as in Fig. 1b overlaid with just the first 
two terms of (11) (without the weight function), 2/¢(1/2)+22(Res(F1/4, iy; )e”""). 

We may now describe Ingham’s result in a more general setting. This argument 
has two principal ingredients. First, using methods dating to Titchmarsh, one shows 
that the sum of the moduli of the residues of Fy(s) at its poles on the positive 
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Fig. 2 A1/4(u) and its 

approximation using only the 
constant term and the single 

periodic term corresponding -0.5 
to the first zero of the zeta 

function on the critical line 


-2.0 


imaginary axis diverges: )~,,., [Res(Fa.iY¥n)| = oo. Second, if the ordinates of 
the zeros of the zeta function in the upper half plane are linearly independent, or 
if there are only finitely many linear dependencies, then by Kronecker’s theorem 
on simultaneous approximation there exist infinitely many choices for v so that the 
complex argument of Res(Fy, iy,)e'”"” is less than an arbitrarily selected « > 0 
in absolute value, for each y, € (0,7). Similarly, there would also exist infinitely 
many choices of v so that the argument of each of these terms is within € of z. 
Thus, on independence of the y,,, by using (11) and the divergence of the sum of the 
residues we see that B* (v) must achieve values that are as large as we like, in either 
the positive or the negative direction. It follows that A,(u) would have unbounded 
oscillations. We state this below; a proof may be found in [24]. 


Theorem 5 Leta é€ [0,1]. If the imaginary parts yy < y2 < --- of the distinct 
zeros of €(s) lying above the real axis satisfy at most a finite number of linear 
relations of the type )~=, CnYn = 9, where the c, are integers, not all zero, then 
lim inf,,99 Ag (u) = —oo and lim sup,,_, 55 Ag (u) = ©. 


3 The Mobius Function and the Riemann Hypothesis 


The Mobius function, defined by x(n) = A(n) if n is squarefree and z(n) = 0 
otherwise, also has a long history in analytic number theory in connection with 
the Riemann hypothesis. We remark here on a few analogues of the statements 
described earlier for the Liouville function. 

For a real number a € [0, 1], we may define 


[(n) 
M,(x) = : 12 
(x) dX = (12) 
The cases a = 0 anda = | have attracted much interest for at least 130 years. 


Stieltjes [29] noted the importance of the function M(x) in his letter to Hermite 
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in 1885. Von Mangoldt [33] proved that lim,—+99 Mj (x) = 0 in 1897, and Landau 
established another proof of this fact in his 1899 dissertation [17]. A few years later, 
Landau [18] noted that the Prime Number Theorem was equivalent to the statement 
that M(x) — 0 as x — oo, in that each of these statements can be derived from the 
other by using only elementary means. 

The function Mo(x) (known in the literature as the Mertens function, and often 
denoted by M(x)) has a particularly interesting history in analytic and computational 
number theory. It has long been known that the Riemann hypothesis and the 
simplicity of the zeros of ¢(s) would follow from a bound on the oscillations of 
Mo(x), for example, the argument shown in (3) works just as well for the Dirichlet 
series for the MGbius function, }°., u(n)/n* = 1/€(s). In fact, Landau’s theorem 
shows that these statements follow from any of the bounds analogous to those shown 
in (2), for example, it would be enough to show the existence of a constant C 
such that Mo(x)/./x < C for all sufficiently large x. Motivated by some of these 
connections, Mertens computed Mo(x) for x < 10000 and conjectured that the 
inequality Mo(x) < ./x for all x > 1 was “very probable.” This Mertens conjecture 
remained open until 1985, when it was disproved by Odlyzko and te Riele [25]. 

Ingham’s article [14] treated the Mébius function as well, and showed that a 
bound of the form Mo(x)/./x < C, or Mo(x)/./x > —C, for some fixed positive 
constant C and all sufficiently large x, would also imply the existence of infinitely 
many integer linear relations among the ordinates of the zeros of the zeta function 
in the upper half plane. In this case, the function corresponding to Fo(s) from (8) is 
F(s) = 1/(s + 1/2)¢(s + 1/2). This has no pole at the origin, and the residue at iy, 
where p = 1/2 + iy is a zero of €(s), is Res(F, iy) = 1/pf'(p). The lack of pole 
at the origin for F(s) produces qualitatively different behavior for the Mobius sums, 
compared to the functions Ly (x): the sums M, (x) exhibit no sign bias. 

We remark that Odlyzko and te Riele’s disproof of the Mertens conjecture 
employed the analog of Theorem 4 for Mo(x). They showed that the sum 


kr(yn)evn 


BO) = 2B De ankaAY 


0<y)<T 


is larger than 1 infinitely often, and smaller than —1 infinitely often. They showed 
this by constructing values for u so that the most significant terms in this sum lined 
up nearly in the direction of the positive real axis, or nearly in the direction of the 
negative real axis. Their method employed the LLL lattice reduction algorithm [20] 
to solve a simultaneous approximation problem using the 70 most significant zeros 
among the first 2000 in the upper half plane. 

In 2015, Best and the second author [4] also employed the LLL algorithm, but in 
a different fashion, to improve bounds on oscillations in the Mertens problem. They 
proved that Mo(x)/./x > 1.6383 infinitely often, and that Mo(x)/./x < —1.6383 
infinitely often. To describe this method, we must first discuss the notion of weak 
independence of a set of real numbers, and an approach to the oscillation problem 
that employs this idea. 
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We return to our study of the normalized Liouville sums Ag(u). From Ingham’s 
method, it is known that arbitrarily large oscillations for Ag (u), for each a € [0, 1], 
would follow from the linear independence of the ordinates of the zeros of the 
zeta function in the upper half plane. Beginning in the early 1970s, a number 
of researchers sought to quantify connections between oscillations in sums of 
arithmetic functions and the linear independence problem. A key idea is the notion 
of weak independence of a set of real numbers: a set is N-independent if no 
linear relations besides the trivial one exist involving its members when the integer 
coefficients of the relations cannot exceed N in absolute value. 

Since true independence implies infinite oscillations, it is perhaps not surprising 
that weak independence might be employed to establish large, though finite, 
oscillations. Conversely, a bound on the oscillations should perhaps lead to the 
existence of linear relations among the zeros that employ only small integer 
coefficients. This latter point of view was developed by Bateman et al. in 1971 
[3]. They proved that the Mertens conjecture, which of course asserts a very strong 
bound on the oscillations of Mo(x), implies the existence of infinitely many linear 
relations among the zeros of the ordinates of the zeta function in the upper half 
plane in which only the coefficients +1 and 2 appear, with 2 appearing at most once 
per relation. Diamond [7], Grosswald [10], and Anderson and Stark [2] develop 
this notion in a more general context. Here, we essentially follow the treatment of 
Anderson and Stark, as it allows some additional freedom in its application. 

Let I denote a set of positive real numbers, and let I’’ denote a subset of 9 
[0, T], for a real parameter T. We say I”’ is N-independent in I’ /\ [0,T] if both of 
the following conditions hold. 


qi) If yer’ cyy = 0 with each c, € Z and each [ey < N, then each c, = 0. 
(i) If per” cyy = y* for some y* € I [0,7] with each c, € Z and each 
|c,| < N, then y* € I’, cy* = 1, and every other c, = 0. 


That is, no nontrivial integer relations among the elements of I’’ with bounded 
coefficients exist, and one cannot manufacture an element of J” with height at most 
T by using an integer relation among the elements of I’’ with bounded coefficients. 

Suppose g(u) is a piecewise continuous real function which is bounded on finite 
intervals, and let G(s) denote its Laplace transform, 


G(s) = [PO scoe du. 


Suppose G(s) is absolutely convergent in the half plane o > oo, and can be 
analytically continued back to o > 0, except for simple poles on the imaginary axis 
occurring at tiy for y € I”, where I" is a set of positive real numbers I” = {y,}, 
and possibly at 0 as well. Anderson and Stark established essentially the following 
theorem. 
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Theorem 6 Let I’ Cc I, and suppose I' is N-independent in I £N {0, T]. Then, 
using the notation above, 


lim inf g(u) < Res(G, 0) — > kr(y) |Res(G, iy)| (13) 
u—>0o N+ N+1 
yer’ 
and 
2N 
lim sup g(u) > Res(G,0) + Nal > kr(y) |Res(G, iy)|, (14) 
u—>Oo 
yer’ 


where kr(t) is an admissible weight function. 

Here, we recall that an admissible weight function is nonnegative, even, is 
supported on [—7, 7], takes the value | at t = 0, and is the Fourier transform of a 
nonnegative function. We include the proof of this theorem from [2] for the reader’s 
convenience. 


Proof For each y € I’, let 6, = arg(Res(G, iy)) and let a, = |Res(G, iy)| kr(y). 
As in (11), define 


Bz(u) = Res(G,0)+ 5° a, (exp(i(yu + 6,)) + exp(—i(yu + 8,))) . 
yer nfo,7] 


For a positive real number U, certainly 


1 i(yu+ by i(yut+ 06 2N 
af, BF(u) I] («x (“~ 7) exp (Se 5 »)) du 


yer’ 


u+ Oy, i(yut+ 6 2N 
= (sua) sf 1 (co (OUE) + oo (HO) )" a 


Uyer 


Expanding the binomial in each term of the product on the right produces 
the constant term ie ) in each factor. Expanding the product then produces a 
trigonometric polynomial where the argument of each exponential function is a 
linear combination of the elements of I’’ where the coefficients are bounded by 
N. Hence, by the first property of N-independence for I”’, the constant term of this 
polynomial is ( | 
ie 

N 

B;(u) produces Res(G, 0) en Terms involving y € ’\I"' from BF(u) contribute 
nothing, since there is no constant term here, due to the second property of N- 
independence of I’ in I 1 [0, 7]. Finally, each of the terms involving a y € I’ 


r'|- 
from the sum in B7;(u) contributes (a, i ) + a, (2) ea ve to this limit. 


. The limiting value of the right side as U — oo is therefore 


ae | Expanding the left side and taking the limit, certainly the constant term of 


Therefore, 


Ir" \r\-1 Ir" 
2N 2n \ (2N 2N 
Res(G, 0) +2 ays sup B(U), 
N N + 1 N yer’ N UeRtT 


The Liouville Function and the Riemann Hypothesis 213 
so 


2N 
sup B7(U) > Res(G,0) + —— 
UeRt N+1 


>> kr(y) [ResG, iy)]. 


yer’ 


The inequality (14) then follows from an analogue of Theorem 4 for the function 
g(u), and the statement (13) then follows by replacing g(u) by —g(u) in this 
argument. 

We remark that Anderson and Stark prove a more general statement, in which 
each coefficient c, in a posited linear relation is governed by its own bound of the 
form |c,| < N,. The conclusion in this more general statement is quite similar— 
the rational number NV/(N + 1) in inequalities (13) and (14) is moved inside each 
respective sum and replaced by N,,/(N, + 1). 


5 Lattices and Improved Oscillation Bounds 


Theorem 6 and its relatives show that in principle an algorithm exists for establishing 
large oscillations, since verifying that no linear relations with bounded coefficients 
exist among a finite set of real numbers is certainly a finite computation. However, 
such a strategy is computationally prohibitive in cases of interest, even if a modest 
bound on N is chosen. Recently, Best and the second author [4] determined a way 
to employ this result of Anderson and Stark in a computationally tractable way, and 
used it to establish their improved bounds in the Mertens problem cited in Sect. 3. 
The main idea is to create a number of integer lattices using a set I’’ of selected zeros 
of the zeta function. These lattices have the property that if I’ is N-dependent in 
I’ N[0, T], then one of them must contain a short nonzero vector. They then establish 
that none of these lattices can possess such a vector, and apply Theorem 6 using a 
computed value of N. We apply this method to establish bounds on oscillations 
for the functions A,(u), including the improved bounds listed in Theorem 1. Our 
algorithmic strategy differs from that of [4] in some significant ways that we also 
detail below. 

Let I" denote the set of ordinates of the zeros of the zeta function in the upper 
half plane. Fix a value for a € [0, 1]. Let T be a positive real parameter, and label the 
elements of IM [0, T] as y1 < yo <--- < Ym. Next, we must select an admissible 
weight function kr(t). Recall that this function must be even and supported on 
[-T,T], satisfy k7(0) = 1, and must be the Fourier transform of a nonnegative 
symmetric function Kr(y). Let n be an integer parameter, with 0 < n < m, and let 
I’ = I''(a,n,T,kr) denote the subset of size n of IM [0, T] consisting of those 
values of this set for which the quantity 


an 5 (2p) 
kr(y) [Res(Fa, iy)| = kr(y) pomral 
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is largest. Label the elements of I”’ by yg, < +++ < Yq,- Next, let b be another integer 
parameter, which governs the precision of our calculations. We construct m—n+ 1 
lattices, labeled Ap, ..., Ajm—n. The lattice Ag has dimension n and lies in R”*!, 
with a basis given by the column vectors of the (n + 1) x n dimensional matrix 
formed by appending the row (| 2° Va | pity La? Va, |) to the bottom of the identity 
matrix /,, where |x] denotes the integer nearest x. For 1 < i < m —n, the lattice 
A; is constructed in a similar way, by grafting (|2 Yar | oe [2? Van | f |2°y*]) onto 
T,41, where y* is a particular value in (7M [0, T])\y’. 

Suppose that I’ were N-dependent in I’M [0,7]. Then one of the following 
conditions would have to hold: 


@® Lyer:cyy = 0, each |c,| < N, not all cy = 0. 
(ii) Dy yer? cyy = y*, each |c,| < N,y* € I’, and it is not the case that c,* = 1 
and every other c, = 0. 
Gil) Dyer yy = y", each |cy| < N,y* € (FP 0[0, T)\r’. 


If case (1) were to occur, then v = (cj, C2,...,Cn, pee Cj | 2° va; |) is a nonzero 


vector in Ag (writing c; for c,,,), and we compute 


n 


jv? = Ye + (> ci (2? ya; | - 7) 


i=1 


n 2 2 
< nN? + (> :) < (5 + n) N?. 


i=1 


(15) 


In the same way, case (ii) would imply the existence of v # 0 in Ao satisfying 


lv? < ud Ne4 (242) 42> (16) 
~\4 2 4’ 
and case (iii) would allow one to deduce the existence of v 4 0 ina lattice Aj, for 
some positive index i, where 


2 N 5 
pes (4 +n)" +3. (17) 


It is enough then to show that none of these lattices contains a short nonzero vector. 
In particular, we need to show that Ag has no nonzero vector satisfying (16), and 
no A; with 1 < i < m—nhas a nontrivial vector satisfying (17). To establish this, 
we employ the following property of real lattices. For the reader’s convenience we 
include its proof, adapted from [6, p. 51]. 


Lemma 1 Suppose {v1,..., Un} are linearly independent real vectors, let A denote 
the Z-lattice spanned by this set, and let {w,,...,Wn} denote its Gram—Schmidt 
orthogonalization. If v is a nonzero vector in A, then |v| > min{|wi|,..., |Wal}. 
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Proof Suppose v is a nontrivial vector in A, and write v = )°*_, a;v; where 
each a; € Z. Let k denote the largest index where a, # 0 in this sum. By the 
Gram-—Schmidt process, we can represent each vector v; as a linear combination of 
WI1,-++5 Wis 


k i k k k-1 [ k 
=> aj ) rw = ) ) AiVig | Wi = Ak KRW + ) ) airy | Wj, 
i=l j=l j=l \ is J=1 \ i 


where each rj € IR. Since ry, = 1 owing to the Gram—Schmidt process, and because 
W1,.--, We are orthogonal, we obtain 


k-1 k 

2 

lv? = ag lw? + 2 | Dairy | [vel 
j=l \isy 


and we conclude that |v|? > |w,|* > minf{|w,|?,..., |wal7}. 

Using Lemma | on the bases we constructed for our lattices Ag, ..., Am—n would 
produce extremely poor lower bounds on the size of a nontrivial vector in these 
lattices, since the given basis vectors of these lattices all point roughly in the same 
direction. However, we are free to apply this lemma to any basis of the lattice, and 
we will certainly obtain much better results if the basis we employ is in some sense 
more orthogonal. We can create such a basis by preprocessing our basis with the 
LLL lattice reduction algorithm [20]. Given a basis for an n-dimensional lattice A, 
this algorithm produces a reduced basis {v,...,V,} having the property that 


n 4 n(n—1)/4 
aasTmls(g) 4. 


i=1 


where 6 is a free parameter in (1/4, 1). The lower bound is simply Hadamard’s 
inequality; the meaningful part here is the upper bound, which can be interpreted 
as a guarantee that the basis vectors produced are in a sense more orthogonal. The 
quality of the basis, as well as the running time of the algorithm, depends on the 
value of 6: larger values produce a better basis at the expense of longer computation 
times. The algorithm is guaranteed to run in polynomial time provided 6 is selected 
in this range. 


6 Computations 


We must select a number of parameters in order to apply this method to the functions 
Aq(u). First, we select a € [0,1], but noting that |Res(Fy,iy)| = |Res(Fi-a,iy)|, 
we need only consider a € [0,1/2]. Next, we must select a weight function 
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k(t). Anderson and Stark, following Ingham, employed the Fejér kernel, where 
kr(t) = 1—|t|/T for t € [—T, T], but after some experimentation we find that the 
kernel introduced by Jurkat and Peyerimhoff [16] and Odlyzko (see [26]) produces 
better results. This kernel, which was also employed in the disproof of the Mertens 
conjecture [25] as well as the recent improvements in this problem [4], is defined by 


kr(t) = 


(1 — |t| /T) cos(at/T) + sin(a |t|/T)/a, |t| < T (18) 
0 


|t| > T. 


Our experiments with the choice of kernel consisted of evaluating the limiting 
values of the bounds in (13) and (14) as N — oo using these two possibilities 
for kr(t), for a number of sets I’ obtained from various choices of T and n. We 
found that the function (18) consistently produced better results, just as Best and the 
second author concluded in [4, Fig. 2]. It is possible that another choice of kernel 
could produce better bounds, but Odlyzko and te Riele [25, Sect. 4.1] indicate that 
potential improvements from such a change are rather limited. 

With our kernel selected, we have four additional parameters to choose for our 
computation: 


¢ T, the height on the critical line for the zeros of ¢(s), or equivalently m, the 
number of zeros in J” M [0, 7]. 

¢ n, the size of the set I”’, and the dimension of the lattice Ag. 

¢ b, the number of bits of precision for computing the zeros of €(s). 

¢ 6, the parameter governing the quality of the basis produced by LLL, as well as 
its run time. 


In the main computation of [4], the authors selected n to be rather large, at 500, 
and b very large as well, at nearly 30,000. This makes for a time-consuming lattice 
reduction step, but they compensated in part for this by selecting a modest value for 
6 at 0.3, and picking m somewhat small at 2000. This computation employed 1501 
lattices, each of dimension 500 or 501, and produced a value of N close to 5000. 

We take a different approach in this computation. We wish to show that Ag(u) > 1 
infinitely often: this can be computed with a number of different choices of the 
parameters. If we select n to be small, then we need to check many zeros of f(s) 
before finding enough with sufficiently large residues, or we need to select T large 
so that the weight function is more generous at the zeros we pick. In any event this 
strategy produces many lattices to analyze. Choosing smaller values for n would 
also encourage selecting a larger 6, which would produce a better result. On the 
other hand, if we pick n to be large, then we will not need as many zeros, so we will 
have fewer lattices to reduce, but each one will require more time, and we may need 
to pick 6 smaller. 

The LLL algorithm has worst-case running time O(n°b*), but often performs bet- 
ter in practice. Through experiments, we found that b needed to grow approximately 
linearly in n, and some empirical trials using b = ©(n) found that the total run time 
required by running LLL followed by Gram—Schmidt grew approximately as n‘*. 
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Table 1 Timing estimates ii Wn b C Time 
(in core-days) for establishing SS 
1.0033 | 417 


that Ag(u) > 1 infinitely often en0 | ee Ene 
250 | 3700 | 7000 | 1.0034 | 376 


260 | 3000 | 7500 | 1.0023 | 407 
270 | 2600 | 8000 | 1.0023 | 435 
280 | 2400 | 8500 | 1.0056 | 471 
290 | 2200 | 9000 | 1.0065 | 476 
300 | 2000 | 9500 | 1.0042 | 480 


1-89, 91-94, 96-98, 100-108, 111, 113-119, 121, 123-130, 132, 134-136, 138, 140-142, 145— 
147, 149, 151-153, 156-160, 163, 164, 169, 171, 172, 174, 175, 178, 180-182, 186, 187, 189, 192, 
193, 197, 199, 204, 205, 208, 210, 212, 213, 215, 216, 222-224, 227, 229, 231, 233, 235, 241, 243, 
247, 253-255, 259, 260, 263, 266, 272, 278, 279, 282, 285, 291, 298, 299, 310, 311, 316, 317, 323, 
324, 327, 330, 331, 337, 343, 344, 350, 356, 363, 364, 369, 377, 390, 398, 403, 407, 410, 424, 436, 
437, 451, 452, 472, 481, 485, 486, 493, 507, 508, 514, 521, 543, 556, 557, 579, 606, 607, 629, 643, 
680, 693, 694, 716, 717, 769, 807, 860, 922, 923, 1278. 


Fig. 3 Indices of the 250 zeros of ¢(s) employed in the calculations 


Using this empirical value, we could estimate the time required to obtain our desired 
result in Pélya’s problem. Some estimates are recorded in Table |. For a number of 
choices of n, this table lists the number of zeros m required to obtain the desired 
result, then our estimate for the number of bits of precision b required in order 
to obtain a value for N near 3000, followed by the best possible result C we could 
obtain (that is, without the attenuating factor of V/(N+1)), and finally the estimated 
time required to complete this calculation, in core-days on a typical workstation. 
These estimates assumed 6 = 0.99. Our results led us to choose n = 250. Some 
additional experiments indicated that we could likely trim our value of b to 6600, 
so our final choice of parameters was n = 250, m = 3700 (and T = y379; — €), 
b = 6600, and 6 = 0.99. 

The indices a; of the n = 250 zeros selected for this computation are shown in 
Fig. 3. The numerical values of these zeros were computed to high precision using 
the mpmath Python library [15]. This package was also used to compute the values 
of the zeta function and its derivative that were needed for the calculation of the 
residues in (10). We constructed the 3451 matrices for the lattices A;, and then used 
SageMath [28] to run each of these through LLL followed by Gram—Schmidt. After 
this, inequalities (16) and (17) were used to determine a lower bound N; on the 
length of a nonzero vector in the lattice Aj. 

These computations were performed on two clusters, one housed in the Research 
Computing Group at Simon Fraser University, and another that is part of the 
WestGrid computing consortium of western Canada. The smallest N; reported from 
these jobs was 2527; the mean value was 3720. With only 62 of these matrices 
producing a bound below 3100, we checked these cases a second time at a higher 
precision, choosing b = 6950. This raised the value for each of these cases to at 
least 6702, so we may use N = 3100 as our value for weak independence. We 
therefore obtain the following result. 
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Theorem 7 The set I’! = {yq, : 1 < i < 250}, where the sequence a; is shown in 
Fig. 3, is 3100-independent in y M [0, T], with T = y379, — 107!°. 

Using this result in Theorem 6 with G = Fp and kr(t) given by (18) immediately 
produces Theorem 1. 

We may now use Theorem 7 to deduce bounds on the oscillations of other sums 
involving the Liouville function. By choosing G = F, in Theorem 6 with a = 1/4, 
1/2, or 3/4, employing formulas for the residues shown in (9) and (10), using the 
kernel (18) again, and translating the results for the functions A, (u) back to the sums 
L,(x) by using (5) and (6), we obtain the following theorem. 


Theorem 8 Each of the following inequalities is satisfied for infinitely many 
integers x. 


pa < —3.0573x"/4, Lyj4(x) > ee (19) 
Lip) — aE <=-1,1634, Li) — ea 9912). (20) 
£(3/2) 0.31824 (3/2) 3.0573 
La) — Fajay <— ge + 86) — Fray > ar 


It may seem that the inequalities in Theorem 8 could be improved by tuning our 
calculation for the specific cases a = 1/2 ora = 3/4. After all, the residues in 
(10) exhibit a dependence on q, so it seems possible that a different set ’’ would be 
optimal for these choices. However, with the same choices for n and m, we find that 
the optimal set of indices for I’ for these cases is precisely the list shown in Fig. 3. 
Thus, the results of Theorem 8 cannot be improved with another calculation of the 
same size by using this method. 

The same method allows us to establish Theorem 2. Figure 4 shows the lower 
bound on lim sup,_,49 Aa(u), as a function of a, that we obtain from Theorem 6 
by using the independence result of Theorem 7. This graph has a root when 
1/€(1/2)(1—2a) = 1.6878471...,so when a = 0.297148 ..., and this establishes 
the first statement in the theorem. The statement that lim inf,,.¢9 Ay (u) is infinitely 
often positive for a > 0.702851... then follows by symmetry. 

As a is taken to be closer to 1/2 it is harder to exhibit an infinitude of sign 
changes. This is not surprising since then the term Res(F,, 0) in (9) is larger and 
hence exerts greater dominance in (11). Indeed Humphries [13, Theorems 1.5, 1.6] 
has proved some conditional results about the biases in Ly(x) when @ tends to 1/2. 


7 Improvements 


We close by briefly noting some prospects for further improvements in these 
oscillation bounds. As noted after the proof of Theorem 6, one could choose a 
bound on c; that depends on i. If, for example, |c;| < N( — i/N), then the 
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0.05 0.10 0.15 0.20 0.25 0.30 


Fig. 4 Lower bound for lim sup,_, ,5 Au (u), for0 < a < 0.3 


sums (>-_, c//2)? in (15)-(17) would have a main term of (Nn)?/16 instead of 
(Nn)?/4.. This is a win; we now must balance this against the loss of attenuating the 
contribution from large zeros. Although these contributions are small, and hence we 
expect an overall win with this strategy, we have not pursued this further. 

We note also the qualitative gains that come from using the weak linear 
independence method. Prior to this method, one exhibited large oscillations in a 
function like Lo(x) by searching for a single large value Lo(xo)—cf. (4). To find an 
even larger oscillation, one needs to search for a larger spike in Lo(x). There is no 
guarantee that such a spike will be found by extending the computational run time 
modestly. Indeed, given that Lo(x) also obtains large negative values, one may have 
to extend the run time considerably to witness the next spike. Using this approach, it 
is difficult to exhibit large oscillations in Lo(x), because Lo(x) has large oscillations! 

In contrast, if one increases the size of the matrices in our method, or accounts 
for more zeros, then one is guaranteed a better bound in the oscillations. However, 
in the absence of any new idea, we believe such improvements will be small 
without radically larger computation times. To illustrate the marginal nature of this 
improvement, we note that Gonek [8] and Hejhal [12] independently conjectured 
that ocyer \¢’(p)|-! x Tog T)!/*. Littlewood [21] showed that (log log t)"! « 
¢(1 + it) « loglogt, whence the behavior of ¢(2p) should be fairly benign. 
Therefore, if the Gonek and Hejhal conjecture were true it would follow from partial 
summation that 


€(2pn) 


< (log N)*4. 
Pn! (Pn) 


», 


O0<yn<N 


Therefore, to increase our sums, and hence to improve on our bounds on the 
oscillations, we would require substantially many more zeros. 
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Explorations in the Theory of Partition Zeta 
Functions 


Ken Ono, Larry Rolen, and Robert Schneider 


Abstract We introduce and survey results on two families of zeta functions 
connected to the multiplicative and additive theories of integer partitions. In the 
case of the multiplicative theory, we provide specialization formulas and results on 
the analytic continuations of these “partition zeta functions,” find unusual formulas 
for the Riemann zeta function, prove identities for multiple zeta values, and see that 
some of the formulas allow for p-adic interpolation. The second family we study 
was anticipated by Manin and makes use of modular forms, functions which are 
intimately related to integer partitions by universal polynomial recurrence relations. 
We survey recent work on these zeta polynomials, including the proof of their 
Riemann Hypothesis. 


1 The Setting: Visions of Euler 


In antiquity, storytellers began their narratives by invoking the muse whose divine 
influence would guide the unfolding imagery. It is fitting, then, that we begin 
this article by praising the immense curiosity of Euler, whose imagination ranged 
playfully across almost the entire landscape of modern mathematical thought. 

Euler made spectacular use of product-sum relations, often arrived at by 
unexpected avenues, thereby inventing one of the principle archetypes of modern 
number theory. Among his many profound identities is the product formula for what 
is now called the Riemann zeta function: 


C(s):= Yin =[][a-p ty! (1) 


n=1 peP 
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With this relation, Euler connected the (at the time) cutting-edge theory of infinite 
series to the ancient set P of prime numbers. Moreover, in solving the famous “Basel 
problem” posed a century earlier by Pietro Mengoli (1644), Euler showed us how to 
compute even powers of z using the zeta function, giving explicit formulas of the 
shape 


€(2N) = 2°" x rational. (2) 


It turns out there are other classes of zeta functions, arising from other Eulerian 
formulas in the universe of partition theory. 

Much like the set of positive integers, but perhaps even more richly, the set 
of integer partitions ripples with striking patterns and beautiful number-theoretic 
phenomena. In fact, the positive integers N are embedded in the integer partitions 
# in a number of ways: obviously, positive integers themselves represent the set 
of partitions into one part; less trivially, the prime decompositions of integers are 
in bijective correspondence with the set of prime partitions, i.e., the partitions into 
prime parts (if we map the number | to the empty partition %), as Alladi and Erdés 
note [1]. We might also identify the divisors of n with the partitions of n into 
identical parts, and there are many other interesting ways to associate integers to 
the set of partitions. 

Euler found another profound product—sum identity, the generating function for 
the partition function p(n) 


co 


[[G-2)7 =dopme, (3) 


n=1 n=0 


single-handedly establishing the theory of integer partitions. This formula doesn’t 
look much like the zeta function identity (1); however, generalizing Euler’s proofs 
of these theorems leads to a new class of partition-theoretic zeta functions. It turns 
out that (1) and (3) both arise as specializations of a single “master” product-sum 
formula. 

Before we proceed, let us fix some notation. Let Y denote the set of all integer 
partitions. Let A = (Aj,A2,...,4,), with A; > Az > --- > A, = 1, denote a 
generic partition, /(A) := r denote its Jength (the number of parts), and |A| := 
Ay +A. +---+A, denote its size (the number being partitioned). We write “A - n” 
to mean A is a partition of n, and “A; € 4” to indicate A; € N is one of the parts of 
A. 

Then we have the following “master” identity (which is a piece of Theorem 1.1 
in [41]). 


Proposition 1 For an arbitrary function f : N > C, we have 


[ [a -fe@a’y! = SO a" T] £0. 


n=l NEP EA 
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The sum on the right is taken over all partitions, and the left-hand product is 
taken over the parts A; of partition A. The proof proceeds along similar lines to 
Euler’s proof of (3), and can be seen immediately if one expands a few terms of the 
infinite product by hand, without collecting coefficients in the usual manner. This 
simple identity also has interesting (and sometimes exotic) representations in terms 
of Eulerian g-series and continued fractions; readers are referred to [41] for details. 


Remark I Note that, collecting like terms, we can rewrite such partition sums as 
standard power series, summed over non-negative integers: 


Va@= Sg yoea (4) 


‘CP n=0 AEn 


It is immediate from Proposition | and (4) that the partition generating function 
(3) results if we let f = 1 identically. 

Similarly, if we set f(n) = n™*,q = 1, and sum instead over the subset Yp of 
prime partitions, we arrive at the Euler product formula (1) for the zeta function. 
This follows from the bijective correspondence between prime partitions and the 
factorizations of natural numbers noted above. 

In light of these observations, we might view the Riemann zeta function as the 
prototype for a new class of combinatorial objects arising from Eulerian methods. 


1.1 Partition-Theoretic Zeta Functions 


Inspired by work of Euler [19], Fine [23], Andrews [2], Bloch and Okounkov [7], 
Zagier [53], Alladi and Erdés [1], and others, the authors here undertake the study 
of a class of zeta functions introduced by the third author in [41], resembling the 
Riemann zeta function €(s) but summed over proper subsets of Y, as opposed to 
over natural numbers. 

In this paper, we review a few of the results from [41], and record a number of 
further identities relating certain zeta functions arising from the theory of partitions 
to various objects in number theory such as Riemann zeta values, multiple zeta 
values, and infinite product formulas. Some of these formulas are related to results 
in the literature; they are presented here as examples of this new class of partition- 
theoretic zeta functions. We also give several formulas for the Riemann zeta 
function, and results on the analytic continuation (or non-existence thereof) of zeta- 
type series formed in this way. Furthermore, we discuss the p-adic interpolation of 
these zeta functions in analogy with the classical work of Kubota and Leopoldt on 
p-adic continuation of the Riemann zeta function [33]. 

To describe our primary object of study, we must introduce a new statistic related 
to partitions, to complement the length and size defined above. We define the integer 
of a partition A, notated as nj, to be the product of its parts: 


nh i= AyA2... A, (5) 
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This multiplicative statistic may not seem very natural as partitions arise purely 
additively, with deep additive structures such as Ramanujan congruences dominat- 
ing the theory. Yet if we let n, formally replace the usual index n in the Riemann 
zeta function, and sum instead over appropriate subsets of partitions, we arrive at an 
analytic-combinatorial object with many nice properties. 


Definition 1 Over a subset #’ C # and value s € C for which the sum converges, 
we define a partition zeta function to be the series 


Very nice relations arise from unique properties of special subsets #’, e.g., 
partitions with some distinguishing structure, or into parts sharing some trait. For 
example, if we let Y* C FY denote partitions into distinct parts, there are interesting 
closed-form expressions, such as 


sinh 7 


Sg+(2) = ; (6) 
1s 


which follows from Proposition | together with a formula of Euler (cf. [41]). By 
summing instead over partitions into parts > 2 (that is, disallowing 1s), we arrive at 
curious identities such as 


3a 


Cx, (3) = cosh (2/3) 


(7) 


which can be found from Proposition | together with a formula of Ramanujan [41]. 
This equation is somewhat surprising, as the Riemann zeta values at odd arguments 
are famously enigmatic. 

Other attractive closed sums can be found—and general structures observed, as 
detailed in [41]—when we restrict our attention to partitions Y wv whose parts all 
lie in a subset .W@ CN. It is easy to check from Proposition | that we have the Euler 
product formula 


ta,@=[[a-e)". (8) 


kEeM 


We see from the right-hand side of (8) that ¢_/(s) diverges if 1 € .@, thus 
the restriction 1 ¢ .@ exhibited in the pair of identities above is a necessary one 
here. In fact, some restriction on the maximum multiplicity of 1 occurring as a part 
is necessary for any partition zeta function to converge. That is, we must sum over 
partitions containing | with multiplicity at most m > 0. Note that the resulting zeta 
function will equal the one in the case where Is are not allowed, multiplied by m+ 1, 
as each n,* is repeated m + | times in the sum (adjoining 1s to a partition does not 
affect its “integer’’). 
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We note that if Dirichlet series coefficients a, are defined by 


bag (8) =: Yo ann, 


n>1 


it is easy to see that a, counts the number of ways to write n as a product of integers 
in .@, where each ordering of factors is only counted once. When .@ = N, then 
these ways of writing n as a product of smaller numbers are known as multiplicative 
partitions, and have been studied in a number of places in the literature; for example, 
the interested reader is referred to [2, 13, 34, 43, 54]. 

We wish to study partition zeta functions over special subsets of YW and 
arguments s that lead to interesting relations. We begin by highlighting a few 
nice-looking examples. Let us recall a few identities from [41] as examples of 
zeta function phenomena induced by suitable partition subsets of the form Ay. 
By summing over partitions into even parts we have a combinatorial formula to 
compute 7: 


CP (2) = > (9) 


Curiously, this partition zeta function produces an odd power of zr, which does 
not occur in Euler’s evaluations of ¢ at even arguments. This simple formula belongs 
to an infinite family of identities arrived at by letting z = 2/m in Euler’s product 
identity for the sine function 


CO we) 
is Zz 
sinz=z] | (1-45). (10) 


n=1 


To describe this family, we take A,n G PF to denote the set of partitions into 


- 


multiples of m > 1. Summing over these partitions, we have identities such as 


b Pyx (2) = = (11) 
m sin (2) 
2 
Cy (4) = - (12) 


m? sin (2) sinh (2) ’ 
and similar (but increasingly complicated) formulas can be computed for ¢g,,,(2') 
with t € N. Clearly (9) is the case m = 2 of the first identity. 

All of these partition zeta formulas seem to hint at analogous algebraicity results 
like that of (2) for the Riemann zeta function. Although the first few examples 
show that the situation is more complicated, such formulas immediately lead one 
to suspect the existence of a similar theory. 

There is indeed a class of partition zeta functions with a structure much like that 
exhibited by classical zeta functions. 
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Definition 2 We define the partition zeta function € y({s}*) to be the following sum 
over all partitions A of fixed length (A) = k > 0: 


tol) = Re) > 1) 


CAj=k 4 


We directly find that Ca({st°) = ng = | and Ca({s}!) = f(s). Proceeding 
much as Euler did to evaluate €(2k) (cf. [19]), for k > 0 we find that €({2}*) is a 
rational multiple of ¢(2k): 


Q?k— 


to(ah) = 5H =e tO) (13) 


It follows that ¢ ({2}*) is of the form “z?*x rational” for all positive k, much like 
the zeta values €(2k) given by Euler. Equation (13) also suggests the well-known 
Riemann zeta value 


=) 


£(0) = = ta (h28") = 1/2, 


which is usually arrived at via analytic continuation. Furthermore, we can write 
6 »({2'*) explicitly for all t € N as finite combinations of well-known zeta values, 
also of the shape “zr m?* x rational” [41]. 

Of course, it was Riemann’s creative analytic continuation of the zeta function 
that led to stunning advances in number theory and analysis. Using general structural 
relations for partition zeta functions, the analytic continuation of ¢({s}*) is given 
in [41] for one case: for fixed length k = 2, we can write 


2 
Co({s}’) = eee (14) 


Thus ¢({s}*) inherits analytic continuation from the two zeta functions on the 
right. One would very much like to see further examples of analytic continuations 
of €({s}*) as well as other partition zeta functions. 

We remark that (14) resembles the well-known “series shuffle product” relation 
for multiple zeta values (see [4]). Hoffman gives lovely formulas using different 
notations, relating ¢({s}*) to combinations of multiple zeta values [27, 28]. As we 
will see below, the theory of partition zeta functions connects with multiple zeta 
values in other ways as well. 
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1.2 Evaluations 


In the previous section we saw a variety of simple closed forms for partition zeta 
functions, depending on the natures of the subsets of partitions being summed over. 
Different subsets induce different zeta phenomena. In what follows, we consider 
the evaluations of a small sampling of possible partition zeta functions having 
particularly pleasing formulas. 


1.2.1 Zeta Functions for Partitions with Parts Restricted by Congruence 
Conditions 


Our first line of study will concern sets .# C N that are defined by congruence 
conditions. Note by (8) that for disjoint sets 4, and“, 


CP gia, (s) = SPa (bP, (s). 


Hence, to study any set of partitions determined by congruence conditions on the 
parts, it suffices to consider series of the form 


CDi 4 my (S)s 


where a € Zso, m € N (excluding the case a = 0, m = 1, where the zeta function 
clearly diverges), and Yj+4 nn is partitions into parts congruent to a modulo m. 
We see examples of the case 6%, (2%) = €24,(2%) in (11) and (12); we are 
interested in the most general case, with s = n € N. 

Our first main result is then the following, where I" is the usual gamma function 
of Euler and e(x) := e?””. The proof will use an elegant and useful formula 
highlighted by Chamberland and Straub in [12], which we note was also inspired 
by previous work on multiplicative partitions in [13]. In fact, the following result 
is a generalization of Theorem 8 of [13] which in our notation corresponds to 
a=m=l1. 


Theorem 1 Forn > 2, we have 


n—-1 


C Pom (n) = TL +a/my "TTP (1 + ma) 


r=0 


Theorem | has several applications. By considering the expansion of the logarithm 
of the gamma function, we easily obtain the following result, in which y is the 
Euler—Mascheroni constant and the principal branch of the logarithm is taken. 
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Corollary 1 For any m,n > 2, we have that 


log (E+. mts (n)) = = nlog(1 + a/m) + ——— ane = Dia -y)- Ye (i+ + ad 


= EDGE = 1) (a’ + (a= e(r/m))*) 
3 


kmk 


r=0 k>2 

When a = 0 and m => 2, we obtain the following strikingly simple formula, 
which is similar to Theorem 7 of [13] that in our notation corresponds to the case 
a=m=1, 


Corollary 2 For any m,n > 2, we have that 


5 (k) 


kak 


log (67, (n)) =n 2 ink 


1.2.2 Connections to Ordinary Riemann Zeta Values 


In addition to providing interesting formulas for values of more exotic partition- 
theoretic zeta functions, the above results also lead to curious formulas for the 
classical Riemann zeta function. In fact, ¢(s) is itself a partition zeta function, 
summed over prime partitions, so it is perhaps not too surprising to find that we can 
learn something about it from a partition-theoretic perspective. Then we continue 
the theme of evaluations by recording a few results expressing the value of ¢ at 
integer argument n > | in terms of gamma factors. 

In the first, curious identity, let . denote the classical Mobius function. We point 
out that this is essentially a generalization of a formula for the case a = m = 1 
given in Eq. (11) of [13]. 


Corollary 3 For all m,n > 2, we have 
nk—-1 r 
[L(k) e(4) 
my" lo = Nn . 
E(n) = aes 2 e(r (: a. 


The next identity gives ¢(n) in terms of the nth derivative of a product of gamma 
functions. The authors were not able to find this formula in the literature; however, 
given the well-known connections between I" and ¢, as well as the known example 
below the following theorem, it is possible that the identity is known. 


Theorem 2 For integers n > 1, we have 


n n-l 


d 
Le lin Teun): 
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Example I As an example of implementing the above identity, take n = 2; then 
using Euler’s well-known product formula for the sine function, it is easy to 
check that 


2 2 


(Q) =~ ji trad ae ee ES es 
= — lim — -zZ= im =—. 
2 sot dz : 7 9 ot dz sin(nz) 6 


This last formula for €(n), following from a formula in [41] together with the 
preceding theorem, is analogous to some extent to the classical identity sin(n) = 


Corollary 4 For integers n > 1, we have 


T]jzo Fr (1 = ze(i/n)) — Tj29 F 1 = zei/n))! 
2z7 , 


&(n) = pe 


1.2.3 Zeta Functions for Partitions of Fixed Length 


We now consider zeta sums of the shape ¢({s}*) as in Definition 2. Our first aim 
will be to extend (13) above (which is Corollary 2.4 of [41]). As we shall see below, 
these zeta values are special cases of sums considered by Hoffman in [27]. 

Let [z"]f represent the coefficient of z” in a power series f. Using this notation, 
we show the following, which in particular gives an algorithmic way to compute 
each ¢ y({m}*) in terms of Riemann zeta values for m € N5». 


Theorem 3 For all m > 2, k € N, we have 


m—1 


Colts) = x™(2"* T] (1 <e(r/m)) 


r=0 


_ Fa Fad exp a (=)” 


jz 
Generalizing the comments just below (13), the next corollary follows directly 
from Theorem 3 (using the well-known fact that £(k) € Qz* for even integers k). 


Corollary 5 For m € 2N even, we have that 
Ex ({m}) € Qa™. 


Remark 2. This can also be deduced from Theorem 2.1 of [27]. 

To conclude this section, we note one explicit method for computing the values 
¢({m\) at integral k,m (especially if m is even, in which case the zeta values 
below are completely elementary). 
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Corollary 6 Form > 2,k €N, andj > i, set 


_ , (k — i)! 
Then we have 
O11 12 13... AK 


—1 a22 23 ... Mr, 
gl” 


k 
to({m*) = ae 0 -l 33... 3% 


0 0 wo. —1] Oke 


Remark 3 There are results resembling these in Knopfmacher and Mays [31]. 


1.3 Analytic Continuation and p-Adic Continuity 


If we jump forward about 100 years from the pathbreaking work of Euler concerning 
special values of the Riemann zeta function at even integers, we arrive at the 
famous work of Riemann in connection with prime number theory. Namely, in 1859, 
Riemann brilliantly described the most significant properties of €(s) following that 
of an Euler product: the analytic continuation and functional equation for C(s). It 
is for this reason, of course, that the zeta function is named after Riemann, and not 
Euler, who had studied this function in some detail, and even conjectured a related 
functional equation. In particular, this analytic continuation allowed Riemann to 
bring the zeta function, and indeed the relatively new field of complex analysis, to 
the forefront of number theory by connecting its roots to the distribution of prime 
numbers. 

It is natural therefore, whenever one is faced with new zeta functions, to ask 
about their prospect for analytic continuation. Here, we offer a brief study of some 
of these properties, in particular showing that the situation for our zeta functions 
is much more singular. Partition-theoretic zeta functions in fact naturally give 
rise to functions with essential singularities. Here, we use Corollary 2 to study 
the continuation properties of partition zeta functions over partitions Yj. into 
multiples of m > 1. In order to state the result we first define, for any ¢ > 0, the right 
half-plane H, := {z € C : Re(z) > e}, and we denote by 4 the set {1/n:n € N}. 


Corollary 7 For any ¢ > 0 andm > 1, g,,(s) has a meromorphic extension to 
HH, with poles exactly at H, M i In particular, there is no analytic continuation 
beyond the right half-plane Re(s) > 0, as there would be an essential singularity at 
s=0. 
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Remark 4 For the function €g,(s), a related discussion of poles and analytic 
continuation was made by the user mohammad-83 in a mathoverflow.net question. 

Finally, we follow Kubota and Leopoldt [33], who showed ¢ could be modified 
slightly to obtain modified zeta functions for any prime p which extend ¢ to the 
space of p-adic integers Z,, to obtain further examples of p-adic zeta functions of 
this sort. These continuations are based on the original observations of Kubota and 
Leopoldt, and, in a rather pleasant manner, on the evaluation formulas discussed 
above. 

In particular, we will use Corollary 6 to p-adically interpolate modified versions 
of ¢({m}*) in the m-aspect. Given the connection discussed in Sect. 1.4 to multiple 
zeta values, these results should be compared with the literature on p-adic multiple 
zeta values (e.g., see [25]), although we note that our p-adic interpolation procedure 
seems to be more direct in the special case we consider. 

The continuation in the m-aspect of this function is also quite natural, as the case 
k = 1 is just that of the Riemann zeta function. Thus, it is natural to search for a 
suitable p-adic zeta function that specializes to the function of Kubota and Leopoldt 
when k = 1. It is also desirable to find a p-adic interpolation result which makes the 
partition-theoretic perspective clear. 

Here, we provide such an interpretation. Let us first denote the set of partitions 
with parts not divisible by p as Y,; then we consider the length-k partition zeta 
values Ca, ({s}*). Note that fork = 1, ¢ P, ({s}1) is just the Riemann zeta function 
with the Euler factor at p removed (as considered by Kubota and Leopoldt). We then 
offer the following p-adic interpolation result. 


Theorem 4 Let k > 1 be fixed, and let p = k + 3 be a prime. Then Ca, ({s*) can 
be extended to a continuous function for s € Zp which agrees with ¢ PD, ({s}*) ona 
positive proportion of integers. 


1.4 Connections to Multiple Zeta Values 


Our final application of the circle of ideas related to partition zeta functions and 
infinite products will be in the theory of multiple zeta values. 


Definition 3 We define for natural numbers m,,m2,...,m, with m > 2 the 
multiple zeta value (commonly written “MZV”’) 


1 
(my, m2, ..., Mx) = 2 mR * 
1 My 


ny>ny>>n>1 1 


We call k the length of the MZV. Furthermore, if m, = m2 = --- = m, are all equal 
to some m € N, we use the common notation 


1 
c({im}):= ye ——__,. (15) 
ny>n>>nm>1 (nn2...Nk) 
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Multiple zeta values have a rich history and enjoy widespread connections; 
interested readers are referred to Zagier’s short note [51], and for a more detailed 
treatment, the excellent lecture notes of Borwein and Zudilin [9]. There are many 
nice closed-form identities in the literature; for example, one can show (see [27]) on 
analogy to (13) that 


2k 


ys 
f({2}") = Oka!" 


(16) 


Similar (but more complicated) expressions for ¢({2'}") for all t € N are given in 
[41], parallel to those mentioned in Sect. 1.1 for 6a ({2'!*). 

Observe that the partition zeta function ¢({m}*) (see Definition 13) can be 
rewritten in a similar-looking form to (15) above: 


tCaltm y= oO — (17) 


(nynz...k)” 
ny >ng>>n>1 ici 


In fact, if we take Y* to denote partitions into distinct parts, then (15) reveals 
£({m}*) is equal to the partition zeta function C+ ({m}*") summed over length-k 
partitions into distinct parts, as pointed out in [41]. Series such as those in (17) have 
been considered and studied extensively by Hoffman (for instance, see [27]). 

By reorganizing sums of the shape (17), we arrive at interesting relations between 
¢@({m}") and families of MZVs. In order to describe these relations, we first recall 
that a composition is simply a finite tuple of natural numbers, and we call the sum 
of these integers the size of the composition. Denote the set of all compositions by 
@ and write |A| = k for A = (a,,a2,...,a)) € @ ifk = aj + ay +--+ + aj. Then 
we obtain the following. 


Proposition 2 Assuming the notation above, we have that 


Ca(im})= SY) E(aym, ay, ... ,ajm). 


Remark 5 Proposition 2 is analogous to results of Hoffman; the reader is referred 
to Theorem 2.1 of [27]. 

In particular, for any n > 1 we can find the following reduction of £({n}*) to 
MZVs of smaller length. We note in passing that Theorem 2.1 of [27] also shows 
directly how to write these values in terms of products (as opposed to simply 
linear combinations) of ordinary Riemann zeta values: hints, perhaps, of further 
connections. We remark in passing that this can be thought of as a sort of “parity 
result” (cf. [29, 45]). 


Corollary 8 For any n,k > 1, the MZV €({n\") of length k can be written as an 
explicit linear combination of MZVs of lengths less than k. 
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As our final result, we give a simple formula for ¢({n}"). This formula is probably 
already known; if k = 2 it follows from a well-known result of Euler (see the 
discussion of H(n) on p. 3 of [52]) and is closely related to (11) and (32) of 
[8]. The idea of the proof is also similar to what has appeared in, for example, 
[52]. However, the authors have decided to include it due to connections with the 
ideas used throughout this paper, and the simple deduction of the formula from 
expressions necessary for the proofs of the results described above. 


Proposition 3. The MZV £({n}*) of length k can be expressed as a linear combina- 
tion of products of ordinary € values. In particular, we have 


s¢tni8) = 1) [2] exp ( ~ > Pav 


jz 


Remark 6 This formula is equivalent to a special case of Theorem 2.1 of [27]. 
However, since the proof is very simple and ties in with the other ideas in this paper, 
we give a proof for the reader’s convenience. 

The proof of Corollary 6 yields a similar determinant formula here. 


Corollary 9 Forn > 2,k € N, andj > i, set 


(k—i)! 
(k-pr 


Big = —F(nG-it 1)) 
Then we have 


Bia Biz Biz --- Bie 
(—1) —1 Bo» Bo3 ... Box 
E({n}) = yet 0 =] Ba vn2 Bae 


0 0 ... -1 Bee 


Remark 7 We can see from the above corollary that ¢({n}*) is a linear combination 
of products of zeta values, which is closely related to formulas of Hoffman [27]. 


1.5 Machinery 
1.5.1 Useful Formulas 


In this section, we collect several formulas that will be key to the proofs of 
the theorems above. We begin with the following beautiful formula given by 
Chamberland and Straub in Theorem 1.1 of [12]. In fact, this formula has a long 
history, going back at least to Sect. 12.13 of [47], and we note that Ding, Feng, and 
Liu independently discovered this same result in Lemma 7 of [17]. 
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Theorem 5 [fn € Nand a,...,@, and Bi,..., By, are complex numbers, none of 
which are non-positive integers, with )°_ oj = 0, Bj, then we have 


“(k+aj) _ y7I(Bj) 
Wer) =U rey 


k>0 j=l j=l 


We will also require two Taylor series expansions for log J”, both of which 
follow easily from Euler’s product definition of the gamma function [20]. The first 
expansion, known as Legendre’s series, is valid for |z| < 1 (see (17) of [48]): 


k 
log F(1 +2) =—yz+ ya (18) 
k>2 
We also have the following expansion valid for |z| < 2 (see (5.7.3) of [36]): 

k 

k z 
log P(1 +2) = —log(1 +2) +20 —y) + DOEDE(D) i) (19) 

k>2 


Furthermore, we need a couple of facts about Bell polynomials (see Chap. 12.3 
of [2]). The nth complete Bell polynomial is the sum 


n 
Bi, see Xn) = D> Brie, 2, see Xn—it1)- 


i=1 


The ith term here is the polynomial 


i n! xX] di x2 p Xn—-i+1 Jn-i+1 
By j(X1,%2,.-- Fe=) mis cal (=) (5) Sas (a) 


where we sum over all sequences j,, j2,... ,jn—i+1 Of non-negative integers such that 
Jit fate bf = and jj + 2fo + 3jg Fe + 0+ Yi = 2. 

With these notations, we use a specialization of the classical Faa di Bruno 
formula [22], which allows us to write the exponential of a formal power series 
as a power series with coefficients related to complete Bell polynomials: 


ae) Bla ax) 
ij KL], ..+5 Ak 
exp y aH — y eg oe (20) 
j=l" k=0 , 


Faa di Bruno also gives an identity [22] that specializes to the following formula 
for the kth complete Bell polynomial in the series above as the determinant of a 
certain k x k matrix: 
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ay ("ao (F'as (aa ak 


—1 a CG Vag CG Yaz ae ae ESA 
0 —1 a Ce Nay Hi eee RSS 
By(ay,...,4) = det] 0 0 -1 iy | ee ee ape (21) 
0 0 0 -1 wee ee Ak—4 
0 0 0 00. Al wg 


1.5.2 Proofs of Theorems 1 and 2, and Their Corollaries 


We begin with the proof of our first main formula. 


Proof (Proof of Theorem 1) By (8), we find that 


n—1 


(a + mj)" G+1+.a/m)" 
F Pat mn”) a aa = Gaye (a +m i)” = i ‘ a—e(r/n) , 
teatmn © uy j=0 r=0 (i +1+ es ) 
Using Theorem 5 and the well-known fact that 
n—1 
do eG/n) = 0 (22) 
j=0 


directly gives the desired result. 


Proof (Proof of Corollary 1) For this, we apply (19) and use (22), the obvious 
fact that 


|(a— eGj/n))/m| < 2, 
and the easily checked fact that 
1+ (a—eG/n)) 


is never a negative real number for j = 0,...,n— 1. 


Proof (Proof of Corollary 2) Here, we simply use (18). Again, the corollary is 
proved following a short, elementary computation, using the classical fact that 


n-1 
— jn ifnlk, 
d e(rk/n) = f else. 


Proof (Proof of Corollary 7) By Corollary 2, we find for n > 2 that 


§ (nk) 


kmkn * 


log (6A, :(n)) = >> 


k>1 
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Suppose that Re(s) > 0 ands ¢ i. Then letting 
K := max{[1/ Re(s)] + 1, Re(s)}, 


it clearly suffices to show that 


§ (sk) 
2S kmks 
k>K 


converges. But in this range on k, by choice we have Re(sk) > 1, so that using the 
assumption m > 2, we find for Re(s) > 0 the upper bound 


t (sk) I 1 
2 kmks s ¢(Ks) 2B kk Re(s) s ¢(Ks) = k2k Re(s) 
k>K k>1 


k>K 


= —£(Ks) log (2~ 8 (28°) _ 1)), 


and note that in the argument of the logarithm in the last step, by assumption we 
have 22°) _ 1 > 0, 

Conversely, if s € i: then it is clear that this representation shows there is a pole 
of the extended partition zeta function, as one of the terms gives a multiple of ¢(1). 


Proof (Proof of Corollary 3) We utilize a variant of Mébius inversion, reversing the 
order of summation in the double sum > 7)51 oa, H(@f (nk)k*: if 


then 


f(n) = > pee) 


k 
k>1 


Applying this inversion procedure to Corollary 2, so that g(n) = logéa,,(n) 
(taking s = 1), and f(n) = ¢(n)/m", we directly find that 


k 
Gin) = ml A tog Cyn). 
k>1 


Applying Theorem | then gives the result. 


Proof (Proof of Theorem 2) By the comments following Theorem 1.1 in [41], for 
M € N we have 


«=i 
1-2) ee : ; (23) 
i ( e x ks | |jea (1 = =) 

isk 
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thus 


Taking WZ = N,s =n € Zs», we apply LHospital’s rule n times to evaluate the 
limit on the right-hand side. The theorem then follows by noting, from Theorem 5, 
that in fact 


n\-—l n—1 
M(1-=) =[[ rd -<zeGi/n)). 
j=0 


Proof (Proof of Corollary 4) Picking up from the proof of Theorem 2 above, it 
follows also from Theorem 1.1 of [41] that 


Subtracting this equation from (23), making the substitutions #@ = N,s =n > 2 
as in the proof above, and using Theorem 5, gives the corollary. 


1.5.3. Proof of Theorem 3 and Its Corollaries 
Proof (Proof of Theorem 3) Using a similar method as in [41] and a similar rewriting 


to that used in the proof of Theorem 1, we note that a short elementary computation 
shows 


k (kK + 1)” 
>, 2 tttm) = [J am -IIM; (k + 1— 2e(r/m))’ 


k>0 k>1 am k>0 r=0 


Much as in the proof of Theorem 2, using Theorem 5, we directly find that this is 
equal to 


m—1 


I] rT (1 _ ~e(r/m)) ; 


r=0 


which gives the first equality in the theorem. Applying Eq. (18) (formally we require 
|z| < z, but we are only interested in formal power series here anyway), we find 
immediately, using a very similar calculation to that in the proof of Corollary 2, that 
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m—| 


(2) sect) = exp [2 (2)" ecriim 


k>0 r=0 j=2 


(24) 


= exp my B (2) 


which is equivalent to the second equality in the theorem. 

Proof (Proof of Corollary 6) Replace x with z” in Eq. (20), and set 
_ G-V'e(mj) 

= rim 


on the left-hand side [which becomes the right-hand side of (24)]. Then comparing 
the right side of (20) to the left side of (24), we deduce that 


mk 
tol{my') = Bulan. ~congl 


To complete the proof, we substitute the determinant in (21) for B,(a,,...,a,) and 
rewrite the terms in the upper half of the resulting matrix as a,j, as defined in the 
statement of the corollary. 


Proof (Proof of Theorem 4) In analogy with the calculation of Theorem 3, we find 
that 


1 

yoo a, my) = [] [= 
k>0 kal kM 

ptk 

(k-+1)" m—1 
Teo ne — — rT a = ze(r/m)) 
m—1 1)” ~~ Z . 
> Tees 0 Gertcsenim) rao (I~ Ze(r/m)) 


As in the calculation of (24), this is equal to 


wo SEM oma 


jz1 
so if we set 


(k—i)! 


Pm) = Cm i+ DE 
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where 


Ss) = dp *)E(9), 


then we have 


(Pp) (P) (vp) (p) 
Ory “12 “1.3 ibe eh 
p 

—1 as, a5 Oy k 


1 > ’ 
Cx, ({m}") = pet Oi as g55 O34 


We further define ¢ , ({m}*) for more general values in C, such as m € —N using the 


analytic continuation of ¢ in each of the factors on (m). Next we recall the Kummer 
congruences, which state that if k,;,k are positive even integers not divisible by 
(p — 1) and k, = ky (mod p**! — p*) for a € N where p > 2 is prime, then 


(1 — p'"') = (1 — p) ae (mod per): 


Let us take S,, to be the set of natural numbers congruent to sy) modulo p — 1. The 
Kummer congruences then imply that for any so # O (mod p — 1), and for any 
ky, ko € Ss) with kj = ky (mod p*) and ky, kp > 1, that 
o*(1— ki) = o*(1— ky) (mod p™*"'). 

If we choose m1, m2 € Ss. with m; = m (mod p*), then the values 1—(1—m,)(G— 
i+ 1), 1—(.—m)G—i+ 1) are in S$) 4(5)~-1)G—i-1) and are congruent modulo p’, 
and as p > k the additional factorial terms (inside and outside the determinant) are 
p-integral. Now in our determinant, j — i + 1 ranges through {1,2,...,k}, and we 
want to find an sg such that 1 + (so —1)r € 0 (mod p—1) forr € {1,2,...k}. If we 
take sy) = 2, then the largest value of 1 + (sg — 1)ris k + 1, which is by assumption 
less than p — 1, and hence not divisible by it. Hence, in our case, s9 = 2 suffices. 
Thus, if m,,m € Sy with m,; = my (mod p*), then 


fa, (tl a m}‘) = fa, (Ql — my") (mod per’), 


This shows that our zeta function is uniformly continuous on Sz in the p-adic 
topology. As this set is dense in Z,, we have shown that the function extends in 
the m-aspect to Zp. 
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1.5.4 Proofs of Results Concerning Multiple Zeta Values 


Proof (Proof of Proposition 2) Recall from (17) that we need to study the sum 


a 
(nn ears ny)” ; 
nm 2ng>—>nz1 


The proof is essentially combinatorial accounting, keeping track of the number of 
ways to split up a sum 


s 


ny >=ng>>ne=1 


over all k-tuples of natural numbers into a chain of equalities and strict inequalities. 
Suppose that we have 


My > 2 >> 1. 


Then if any of these inequalities is an equality, say nj = n +1, in the contribution to 
the sum 


Y Gs 


n=n>—>nzl 


the terms n; and nj+; “double up.” That is, we can delete the nj; and replace the 
n,”" in the sum with an a Thus, the reader will find that our goal is to keep track 
of different orderings of > and =, taking symmetries into account. The possible 
chains of = and > are encoded by the set of compositions of size k, by associating 
to the composition (a;,...,a;) the chain of inequalities 


Ayr = Nay > Na, +1 St = Nayt+az > Nay+1 >t > Nk. 


That is, the number a, determines the number of initial terms on the right which 
are equal before the first inequality, a. counts the number of equalities in the next 
block of inequalities, and so on. It is clear that the sum corresponding to the each 
composition then contributes the desired amount to the partition zeta value in the 
corollary. 


Proof (Proof of Corollary 8) In Proposition 2, comparison with Corollary 6 
shows that we have a linear relation among MZVs and products of zeta values. 
Observe that in £({m}*), the only composition of length k is (1, 1,..., 1), which 
contributes k!¢({m}*) to the right-hand side of Proposition 2, and that the rest of 
the compositions are of lower length, hence giving MZVs of smaller length; the 
corollary follows immediately. 
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Proof (Proof of Proposition 3) Consider the multiple zeta value ¢({n}*) of length k. 
Then we directly compute 


eyed = TT G-(4))=T] ees _— nm) 


k>0 m>1 m>0 r=0 


By Theorem 5, this equals 


n—-1 


I] P(1—ze(r/n)) 1. 


r=0 


Using precisely the same computation as was made in the proof of Theorem 3, we 
find that this is equal to 


exp | —n aoe oO) 
j22 
nj 
Hence, we have that 
E({n}5) = (-1)* [2] ye — zi 


jz 


Proof (Proof of Corollary 9) Here we proceed exactly as in the proof of Corollary 6, 
except we make the simpler substitution 


= (k— 1)! (nk) 


into Eq. (20), and compare with Proposition 3. In the final step, we replace the terms 
in the upper half of the matrix with 6;; as defined in the statement of the corollary. 


1.6 Some Further Thoughts 


We have presented samples of a few varieties of flora one finds at the fertile inter- 
section of combinatorics and analysis. What unifies all of these is the perspective 
that they represent instances of partition zeta functions, with proofs that fit naturally 
into the Eulerian theory we propound. 
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We close this article by noting a general class of partition-theoretic analogs of 
classical Dirichlet series having the form 


Qo (fs) = y fAn, 


Aco’ 


where & is a proper subset of Y and f : Y’ — C. Of course, partition zeta 
functions arise from the specialization f = 1, just as in the classical case. 

Taking Y’ = Py as defined previously, then if f := f (nj) is completely 
multiplicative with appropriate growth conditions, it follows from Theorem 1.1 of 
[41] that we have a generalization of (8) 


. —1 
Dp af.s) = T] (: -) (Re(s) > 1), (25) 


JEM 


and nearly the entire theory of partition zeta functions noted here (and developed in 
[41]) extends to these series as well. 


2 Zeta Polynomials 


We now turn our attention toward a different layer of connections between partitions 
and zeta functions, via the theory of modular forms. Although Euler’s generating 
function for p(n) is essentially modular, and Euler also anticipated the study of 
L-functions that are intimately tied to modular forms, the true depth of such 
observations did not come into view until further work on complex analysis was 
carried out in the nineteenth century. 

It turns out that all modular forms are related to partitions in a very direct 
way. Here we recall the case of this connection for modular forms for the full 
modular group SL2(Z). We then use these modular forms to define the second class 
of functions that are the topic of this paper, the zeta polynomials associated with 
modular forms. 


2.1 Partitions and Modular Forms 


In a paper from 2004 [11], the first author, Bruinier, and Kohnen investigated the 
values of a certain sequence of modular functions in connection with the arithmetic 
properties of meromorphic modular forms on SL,(Z). One of the main results in the 
paper shows that integer partitions and a universal sequence of polynomials encode 
the Fourier expansions of modular forms. 
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Here we recall this result, which can be thought of as a precise formulation of 
the assertion that a modular form is distinguished by its “first few coefficients.” In 
fact, by making use of partitions we have an effective recursive procedure which 
computes the Fourier coefficients in order. 

We take q := e?” throughout this section. Now, suppose that 


f@) = Yo ap(n)q" 
n=h 


is a weight k € 2Z meromorphic modular form on SL2(Z). If k > 2 is even, then let 
E;(z) denote the normalized Eisenstein series 


2k = 
Ex(@) = 1— FY onan)d. (26) 
n=1 


Here B;, denotes the usual kth Bernoulli number and ox—1(n) := >> d\n d Tfk > 2, 
then £;(z) is a weight k modular form on SL2(Z); although the Eisenstein series 


Ep(z) = 1-24) ° ai (n)q" (27) 
n=1 


is not a modular form, it also plays an important role. As usual, let j(z) denote 
the modular function on SL2(Z) which is holomorphic on H, the upper half of the 
complex plane, with Fourier expansion 


j(2) = q7! +. 744 + 196884q + 21493760q7 +--+. 


We will require a specific sequence of modular functions j,,(z); to define this 
sequence, we set 


Jo(Z := 1 and =, (z) := j(z) — 744. (28) 
If m > 2, then define j,,(z) by 
Jm(2) ?= ji(z) | To(m), (29) 


where 7y(m) is the usual normalized mth weight zero Hecke operator. Each j,,(z) is 
a monic polynomial in j(z) of degree m. Here we list the first few: 


Jjo(z) = 1, 
ji) = j(2) — 744 = q7! + 196884g +--+, 
jal) = jz)? — 1488j(z) + 159768 = gq? + 42987520g +--+, 


3) = j(2* — 2232j(z)? + 1069956j(z) — 36866976 = gq? + 2592899910g +++. 
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Let ¥ denote the usual fundamental domain of the action of SL2(Z) on H. By 
assumption, § does not include the cusp at oo. Throughout, let i := —1 and let 
w := (1+ V—3)/2. If t € §, then define e, by 


1/2 ift =i, 
ep = 41/3 ift =a, (30) 
1 otherwise. 


By studying the logarithmic derivatives of modular forms, the first author, 
Bruinier, and Kohnen obtained the next result in [11], which offers universal 
polynomial recursion formulas for the Fourier coefficients of modular forms. 


Theorem 6 For every positive integer n define F,,(x1,...,%n) € Q(x, ..-,Xn] by 


2x10) (n) 
Fy (Qty 20254) == 
n 
oe — 1)! 
+ » (crymctetmes OF Mat = DE ym mt, 
my, |... Mp—! 
M1, ...,Mn—1=0. 


my 42mg t+ (n—Dnin— =n 


F{f@=q' + yah + n)q'*" is a weight k meromorphic modular form on 
SL2(Z), then for every positive integer n we have 


ar(h +n) = Fy,(k,ap(h + 1),...,ap(h +n —1)) — . Zee - j,(T). 


Remark 8 Theorem 6 illustrates a deep connection between integer partitions and 
the coefficients of modular forms. An inspection of the summation in the theorem 
reveals the role of partitions: the partitions of n, apart from (n) itself, determine the 
recurrence formula for ar(h + n). We note that for each such partition 


A= ((n— 1) (n= 2) 22.271) En 
with m, being the multiplicity of k as a part, we can rewrite m + m2 +--:++m,-1 = 


£(A). 


The first few polynomials F,, are 
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Fy (x1) = —2x,, 


Zi 
Fo(x1,X2) = —3x, + a 


8x, x 
F3(x1,X2,%3) = ——— — 4 + xx3, 
3 3 
oP ~~. 2 
Fg (1, X2,.%3,.X4) = =a — 3x3 + xox4 + m ahs ze 


The n = 1 case of Theorem 6 implies that 


a(h + 1) = 60k — 744h — ) “e,ord, (f) - f(t). 


TES 


Example I Since 


CO 


A() = Yo t(n)q" = q— 249? + 2529? ----, 


n=1 


the unique normalized weight 12 cusp form on SL2(Z) is nonvanishing in H, 
Theorem 6 implies that 


t(n+ 1) = F,(12, t(2),...,T(n)). 


2.2 Manin’s Zeta Polynomials 


Theorem 6 shows that integer partitions and universal polynomials play central 
roles in computing the Fourier coefficients of modular forms. The only additional 
data required is a form’s leading coefficient, weight, and its divisor. In the spirit of 
the previous section, it is then natural to ask for a theory of zeta functions related 
to partitions, which somehow arises from this connection. Here we address this 
problem by giving a brief exposition of recent work on a problem of Manin on zeta 
polynomials. 

To begin, we consider any newform f = >>, dng” € Sx(Zo(N)) of even weight 
k and level N. The standard zeta-type function associated with f is its L-function 


L(f,s):= 2 ang’; 


n>1 


which may be normalized so that the completed L-function 


A(f,s) = (2) rows 
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may be analytically continued and satisfies the functional equation 


A(f,s) = eA. k —s), 


with e(f) = +1. The critical L-values are the complex numbers L(f, 1), L(f, 2), 
..., L(f,k — 1). Manin’s conjecture then states that these critical L-values can be 
suitably packaged to fit into the following framework. 


Definition 4 (Manin) A polynomial Z(s) is a zeta polynomial if it satisfies the 
following criteria: 


. (Naturality) It is arithmetic-geometric in origin. 

. (Functional Equation) For s € C we have Z(s) = +Z(1—s). 
. (Riemann Hypothesis) If Z(o) = 0, then Re(p) = 1/2. 

. The values Z(—n) have a “nice” generating function 

. The values Z(—n) encode arithmetic-geometric information. 


AWN Re 


We remark that, of course, it is very important that the arithmetic-geometric 
nature of the function in part (1) of the definition is satisfied. For example, 
any polynomial with real coefficients and satisfying the Riemann Hypothesis 
automatically satisfies the functional equation in part (2), and hence the compelling 
arithmetic nature of the particular object being constructed is critical. Part (4) is also 
central to Manin’s idea, and is meant to parallel Euler’s power series expansion for 
the Riemann zeta function: 


t 1 7 ? 
pags = lt 5H pars n)-— G1) 
This generating function (31) for the values ¢(—n) also has a well-known interpre- 
tation in K-theory [24], in line with part (5) of the above definition. Namely, it is 
essentially the generation function for the torsion of the K-groups for Q. 

Recently in [37], Sprung and the first two authors confirmed Manin’s specula- 
tions and offered the following resolution to his question. To describe this result, 
first consider the mth weighted moments of critical values: 


i-2 ae 2 
aia, 16720, 1 ae 
m= (2) &-2=p)r  €-D! ( j Jags 


j=0 j=o 


(32) 
For positive integers n, we recall the usual generating function for the (signed) 
Stirling numbers of the first kind: 


n 


(X)n = x(x — 1)(x—2)---(@—n4+ 1) =: s(n, m)x” (33) 


m=0 
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Using these numbers, the zeta function satisfying Manin’s definition is given by 


k—2 k—2-h & ah 


Z;(s):= Yi(-s)" YO : 


h=0 m=0 


-s(k—2,m-+h)-My(m). (34) 


The main result of [37] confirms that this natural object satisfies a zeta-type 
functional equation and the Riemann Hypothesis. 


Theorem 7 (Theorem 1.1 of [37]) If f € S,(1o(N)) is an even weight kk > 4 
newform, then the following are true: 


1, For alls € C we have that Z;(s) = e(f)Z( — 8). 
2. If Z(p) = 0, then Re(p) = 1/2. 


In accordance with Definition 4, we also want to find natural interpretations 
of the values Z-(—n) at negative integers and for their generating functions. This 
may be accomplished by considering the “Rodriguez-Villegas Transform” of [40]. 
Theorem 7 is naturally related to the arithmetic of period polynomials! 


k—2 
woe (157) -agk—t- aed (35) 


j=0 


The values of Zr(s) at non-positive integers are then the coefficients expanded 
around z = 0 of the rational function 


Ry (z) 
(1 = zjel . 


The following result is the analogue of (31) for the Riemann zeta function, and is 
our answer to part (4) of the above definition. 


Theorem 8 (Theorem 1.3 of [37]) Assuming the notations and hypotheses above, 
as a power Series in z we have 


Re) 
(1 eral y = ae a 

In accordance with the aforementioned K-theoretic interpretation of the values 

of ¢(s) at negative integers, and in relation to part (5) of Definition 4, it is natural 

to ask whether the z-series in Theorem 8 has an analogous interpretation—that 

is, what (if any) arithmetic information is encoded by the values Z-(—n)? Hints 

along these lines were first offered by Manin in [35], where he produced similar 


'This is a slight renormalization of the period polynomials considered in references such as [15, 
32, 38, 50]. 
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zeta polynomials by applying the Rodriguez-Villegas transform to the odd period 
polynomials for Hecke eigenforms on SL2(Z) studied by Conrey et al. [16]. He 
asked for a generalization for the full period polynomials for such Hecke eigenforms 
in connection to recent work of El-Guindy and Raji [21]. Theorems 7 and 8 answer 
this question and provide the generalization for all even weight k > 4 newforms on 
congruence subgroups of the form J9(V). Theorem 7 additionally offers an explicit 
combinatorial description of the zeta polynomials in terms of weighted moments. In 
fact, the developments of [37] are made possible by the recent general proof of the 
following Riemann Hypothesis for period polynomials. 


Theorem 9 ((30]) For any even integer k > 4, and any level N, all of the zeros of 
the period polynomial Ry(z) are on the unit circle. 


Remark 9 The reason that the term “Riemann Hypothesis” is fitting for this result is 
that the period polynomials satisfy a natural functional equation relating the values 
at z and —1/z (itself arising from the functional equation for L(f, s)), and this result 
states that all zeros of the period polynomials lie on the line of symmetry for their 
functional equations. 


Remark 10 In fact, the authors of [30] prove a somewhat stronger statement, giving 
rather precise bounds on the location of the angles of these zeros. 

In line with part (5) of Definition 4, we offer a conjectural combinatorial 
arithmetic-geometric interpretation of the Z,(s). To this end, we make use of the 
Bloch—Kato Conjecture, which offers a Galois cohomological interpretation for 
critical values of motivic L-functions [6]. Here we consider the special case of the 
critical values L(f, 1), L(f,2),..., Lf, k— 1). These conjectures are concerned with 
motives .% associated with f, but the data needed for this conjecture can be found 
in the A-adic realization V, of .@ for a prime A of Q(f), where Q(f) is the field 
generated by the Hecke eigenvalues a,(f) (where we have a;(f) = 1). The Galois 
representation V) associated with f is due to Deligne, and we recall the essential 
properties below. For a high-brow construction of V, from -%, we refer to the 
seminal paper of Scholl [42]. 

Deligne’s theorem says that for a prime A of Og) lying above J, there is a 
continuous linear representation V, unramified outside IN 


pra : Gal(Q/Q) > GL(V,), 
so that for a prime p + IN, the arithmetic Frobenius Frob, satisfies 
Tr(or(Frob;, ')) = a,(f), and det(p,(Frob,')) = pe, 
We may also consider the jth Tate twist V,(j), which is V, but with the action of 


Frobenius multiplied by p’. After choosing a Gal(Q/Q)-stable lattice T, in V,, we 
may consider the short exact sequence 


0 — Ti(j) > Vil) “> Va/T1G) — 0. 
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Bloch and Kato define local conditions H;(Q,, V,(j)) for each prime p. We let 
H; (Q, V(/)) be the corresponding global object, i.e., the elements of H'(Q, Vi(j)) 
whose restriction at p lies in H;(Q,,V,(j)). Analogously, we may define 
Hy (QV /T}(j)), which is the Bloch—-Kato A- Selmer group. The Safarevié—Tate 
group is 


. A; (Q, Va/TiG)) 
Ip) = D a yy)” 
x Tx He (Q, Va@/)) 
The Bloch—Kato Tamagawa number conjecture then claims the following. 
Conjecture 1 (Bloch-Kato) Let 0 <j < k — 2, and assume L(f,j + 1) 4 0. Then 


we have 


LG J+) Tam(j + 1)#U1( + 1) — 
Onitiqroyt ~"'" gga nHHe(k—-1—) oe 
(27iy Q Q J 


Here, §2~ denotes the Deligne period, Tam denotes the product of the Tamagawa 
numbers, Ho is the set of global points, and u;+, is a non-specified unit of Q(f). 


Remark 11 Note that L(f,j + 1) 4 0 in this range provided that j + 1 4 k/2. 
We denote the normalized version of C(j + 1) by 


(iVNytlQoyt 
(kK-2—j)! 


CG+ 1 =CG+1)- (36) 


but when L(f,j + 1) = 0, we define CcG+l := 0. 


Theorem 10 (Theorem 1.4 of [37]) Assuming the Bloch-Kato Conjecture and the 
notation above, we have that 


Mpm) = S> CG+Di". 


0<j<k-2 


which in turn implies for each non-negative integer n that 


k—-2 [k—2 k-2-h Pree ees 
atm = 3 (¥ >. m( ; ) stam 4-09) CTR 1). 


j=0 \h=0 m=0 


Finally, we offer an intriguing connection of the zeta polynomials Z,(s) to the 
world of combinatorial geometry. This investigation is motivated by the following 
question. In light of the vast importance of the distribution of the zeros of the 
Riemann zeta function on (presumably) the critical line, it is natural to consider 
the distribution of the zeros of Z(s) on the line Re(s) = 1/2. Of course, the Z;(s) 
are polynomials, and so any interesting distributional properties only make sense 
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in the context of limits of sequences of modular forms as the level or weights go 
to infinity. In this sense, one can ask: do their zeros behave in a manner which is 
analogous to the zeros of the Riemann zeta function €(s)? Namely, how are their 
zeros distributed in comparison with the growth of 


N(t) :=#{p =st+it : €(p) =0 withO <t<T}, 


which is well known to satisfy 


T T T 
N(T) = — log — — — + O(log T)? (37) 
2n 2n 2 


As we shall see, the zeros of Z;(s) behave in a manner that is somewhat analogous 
to (37) in terms of its highest zero. 

To make this precise, it turns out that the correct answer lies in comparing Z;(s) 
with two families of combinatorial polynomials. In what follows, we note that for 
x,y € C, the binomial coefficient (*) is defined by 


y. 


x\ r(«+1) 
yy} POtDPre@-yt) 


We see below that the Z;(s), depending on e¢(f), can naturally be estimated by the 


polynomials 
s+tk—2 Ss 
Hi (s):= : 38 
x (S) ( -_ )+(.2,) (38) 


k-3 _: -=3 
Hy (s) ->( oe ) (39) 


Theorem 11 (Theorem 1.3 of [37]) Assuming the notations and hypotheses above, 
the following are true: 


1. The zeros of H, (—s) lie on the line Re(s) = 1/2, and they are the complex 
numbers p = 5 + it with t € R such that the value of the monotonically 
decreasing function 


— 2t 
h(t) = i 
c(O) x co ( ae :) 


lies in the set {,27,...,(k — 3)s}. Similarly, the zeros of Hii (—s) lie on the 
line Re(s) = 1/2 and have imaginary parts t which may be found by solving for 
hy(t) to lie in the set {1/2,3m/2,...,(K — 5/2)m}. Moreover, as k > on, the 
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highest pair of complex conjugate roots of H, (s) have imaginary part equal in 
absolute value to 


(K— 3)(K—1) 
20 


and the height of the highest roots of Hf (s) is 


+ O(), 


et cheat O(1). 
IU 


2. Let f € S4([0(N)) be a newform. If e(f) = —1, then the only root of Z,(s) is at 
s = 1/2. If e(f) = 1, then there are two roots of Z;(s), and as N > o, their 
roots converge on the sixth order roots of unity exp(+7i/3). 

3. For fixedk > 6, as N — +00, the zeros of Z(s) for newforms f € Sx(1o(N)) with 
e(f) = +1 converge to the zeros of HE (—s). Moreover, for all k, N, if e(f) = 1 
(resp. e(f) = —1), then the imaginary part of the largest root is strictly bounded 
by (k — 3) (k — 4) (resp. (k — 4) (k — 3)). 


Remark 12 Theorem 11 (3) is somewhat analogous to (37). Since the zeros of Z;(s) 
are approximated by those of H;- (—s), the analog of N(T) is dictated by Theorem 11 
(1), where the largest zero has imaginary part ~ Ls or ~ E depending on the 
sign of the functional equation. Numerical investigations indicate that the locations 
of the “high” zeros are, however, somewhat differently arranged than those of the 
Riemann zeta function, the nth highest pair of zeros being at approximately height 
No/n, where No is the highest zero. 

As we shall describe shortly, the zeta polynomials Zr(s) may be thought of, via 
Theorem 11, as arithmetic-geometric Ehrhart polynomials. In this comparison, the 
combinatorial structure in Theorem 10, which we call the “Bloch—Kato complex,” 
serves as an analogue of a polytope. Assuming the Bloch—Kato Conjecture, Theo- 
rem 10 describes the values Z-(—n) as combinatorial sums of m-weighted moments 
of the jth Bloch—-Kato components. To describe this combinatorial structure, we 
made use of the Stirling numbers s(n, k) which can be arranged in a “Pascal-type” 


triangle 


1 
0 1 
0 —1 1 
0 2 —3 1 
0 —6 11 —6 1 
0 24 —50 35 —10 1 
0 -120 274 —225 85 -15 #1 


thanks to the recurrence relation 


s(n, k) = s(n—1,k-—1)-—(n—-1)-s(n—- 1,4). 
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This follows from the obvious relation 
(x)n = (X)n—1(% =Ase 1) = X(X)n-1 _ (n _ 1)(®)n-1. 


The Bloch—Kato complex is then obtained by cobbling together weighted layers of 
these Pascal-type triangles using the binomial coefficients appearing in (34). 

The connection to Ehrhart polynomials arises from the central role played by the 
H (—s) in our study of the Z;(s). In [40], Rodriguez-Villegas proved that certain 
Hilbert polynomials, such as the He (—s), which are Rodriguez-Villegas transforms 
of x‘? + 1, are examples of zeta polynomials. These well-studied combinatorial 
polynomials encode important geometric structure such as the distribution of 
integral points in polytopes. 

Given a d-dimensional integral lattice polytope in R”, we recall that the Ehrhart 
polynomial -Z, (x) is determined by 


ZL, (m) = #{p € Z": pemF}. 


The polynomials H; (s) whose behavior determines an estimate for those of Z;(s) 


(when e(f) = —1) as per Theorem 11 are the Ehrhart polynomials of the simplex 
(cf. [5]) 
k-3 
CONV 4 €1, €2,..--, Ck—-3, — 3 ej 
j=l 


We note that in Sect.1.10 of [26], Gunnells and Rodriguez-Villegas also gave 
an enticing interpretation of the modular-type behavior of Ehrhart polynomials. 
Namely, they noted that the polytopes P with vertices in a lattice L, when acted upon 
by GL(Z) in the usual way, have a fixed Ehrhart polynomial for each equivalence 
class of polytopes. Hence, these classes may be thought of as points on a “modular 
curve,” and the operation of taking the £th coefficient of the Ehrhart polynomial is 
analogous to a modular form. This analogy is strengthened as they define a natural 
Hecke operator on the set of Ehrhart polynomials, such that the ¢th coefficients of 
them are eigenfunctions. Moreover, they show that these eigenclasses are all related 
to explicit, simple Galois representations. Thus it is natural—as well as intriguing— 
to speculate about the relationship between these observations and Theorem 11. In 
particular, we have shown that as the level NV of cusp forms of a fixed weight k tends 
to infinity, the coefficients of the zeta polynomial Z;(s) tend to (a multiple of) these 
coefficients of Ehrhart polynomials considered in [26]. It is also interesting to note 
that Zagier defined (see [49]) a natural Hecke operator on the period polynomials of 
cusp forms, which commutes with the usual Hecke operators acting on cusp forms. 
Thus one may ask if there is a reasonable interpretation of Hecke operators on the 
zeta functions Z;(s), which ties together this circle of ideas. 

In the following sections, we will give a brief outline of the main underlying 
ideas of the proofs of these results. In particular, we devote the next subsection to 
some tools needed for the proofs, follow up with the proof sketches, and finish with 
several examples. 
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2.3 Tools Needed for the Proof of Manin’s Conjecture 
2.3.1 Work of Rodriguez-Villegas 


We begin our summary of key tools for the proofs of the preceding section by 
recalling the clever and useful observations of Rodriguez- Villegas in [40]. We will 
only be concerned with a specialized result; although the reader is referred to the 
original paper for a more general version and interesting mathematical context of his 
work. We first take a polynomial U(z) of degree e with U(1) # 0. The nonvanishing 
condition is non-essential, as otherwise one simply factors out all the powers of 1—z 
from the polynomial and then applies the Rodriguez- Villegas transform; however, it 
is convenient for the simplest description to have the exact degree of the polynomial 
with these factors pinned down. We then define the rational function 


-_ - @) 
P(z):= Ga-pet 


Expanding in z, we have 


P(z) = Yo hinz", 


n=0 


and it is easy to see that there is a polynomial H(z) of degree d— 1 such that H(n) = 
h, for all n. The key observation of [40] is then as follows. 


Theorem 12 (Rodriguez-Villegas) Jf all roots of U lie on the unit circle, then all 
roots of Z(s) lie on the vertical line Re(z) = 1/2. Moreover, if U has real coefficients 
and U(1) # 0, then Z(s) satisfies the functional equation 

Z(1 —s) = (-1)* '!Z(s). 
Proof The first claim is simply the special case of the Theorem of [40] when 
d = e+ 1. The second claim was described in Sect. 4 of [40], but for the reader’s 


convenience we sketch the proof. By the single proposition of [40], it suffices to 
show that P(1/z) = (—1)4x“~¢P(z). Now suppose that U factors as 


U(z) = @— pi)... @— pe), 


where each p; is on the unit circle but not equal to 1. Then 


zu(=) = (1—zp1)...(1 — Ze). 
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Since the coefficients of U are real, we have (—1)°p) (2... ¢ = 1, and dividing by 


this quantity yields 
1 —e 
Ut —) =z “U(z). 
Zz 


The claimed transformation for P then follows directly from the definition. 


2.3.2 Period Polynomials at 1 
Here, we recall a simple result that we shall need, related to the order of Ry(s) at 
‘ae 


Lemma 1 Assuming the notation above, Ry(1) 4 0 if e(f) = 1, and Ry(s) has a 
simple zero ats = 1 ife(f) = -1. 


Proof The functional equation for A(f, s) shows that 


II 


k—2 
R= ( : *)auns 1) 


j=0 


A(f.§) +2 Dicep (PAGI+ VD ife) =1, 


(40) 
A (f.5) if e(f) = —1. 

Now A(f,5s) is real-valued on the real line, and well-known work of Waldspurger 

[46] implies that A (f, ) > 0. Furthermore, Lemma 2.1 of [30] states that 


0<A(r5) sa(p5 +1) 5-5 Aged. (41) 


and that A (f, 4) = Oif e(f) = —1. So, if e(f) = 1, then the expression in the 
first case of (40) is composed of all non-negative terms, which cannot all vanish as 
it is impossible for all periods of f to be zero. Hence, in this case, Rr(1) # 0. If 
e(f) = —1, then as A (f. a) = 0, we see that Ry(1) = 0. To see that this zero is 
simple, note in a similar manner that all terms in R-() are non-negative, with the 
last term being (k — 2) A(f,k — 1). But this term cannot be zero, as the chain of 
inequalities in (41) would then imply that all periods of f are zero. 


2.3.3 Ingredients for Theorem 10 


We first describe the local conditions H; (Q,, Va(/)) for a given prime p, following 
[6, Sect. 3]. Recall that A was the prime above / in Deligne’s representation V). 
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The first case is when p = /. Here, we define 


Hy (Q,, Va (j)) := ker (Hy (Dp, Va (j)) a Ht (Dp, Va (j) ® ws)) ; 


where D, denotes a decomposition group for a prime over p. For a definition of the 
Q,-algebra Bais, we refer to Berger’s article [3, II.3]. 
For the other cases (i.e., p 4 1), we let 


Hy (Qy, VaGi)) := ker (Hy (Dp, Vii) > He Up, VaG))) 


where J, is the inertia subgroup. We let H; (Q, Vi(/)) be the corresponding global 
object, ie., the elements of H!(Q, V,(/)) whose restriction at p lies in Hy (Q,, Va (j)). 

We note that Bloch and Kato’s Tamagawa number Conjecture | is independent of 
any choices, cf. [18, Sect. 6], or for more detail cf. [6, Proposition 5.14 (iii)] and [6, 
p. 376], in which the independence of the choice in lattice in the Betti cohomology 
is discussed. 

Secondly, we describe the set of global points H%, with the appropriate Tate 
twists: 


H4() = PHO V/A) 
i 


2.4 Sketch of Proofs for Theorems on Zeta Polynomials and 
the Riemann Hypothesis for Period Polynomials 


Here we sketch the proofs of the theorems in Sect. 2.2. We begin with the proof of 
the Riemann Hypothesis for period polynomials, which is an essential ingredient in 
the main proofs about zeta polynomials. 


Proof (Sketch of the Proof of Theorem 9) The general strategy is to consider, for 
mi= ee the polynomial 


I zk m k 
PAX) = 5 (°") A (r. 5) +>. ia ) A (r. ‘ +i) xi, 
j=l 


The result then follows if the unit circle contains all of the zeros of 
T;(X) := Pr(X) + e(f)Pf(1/X). 


Following a standard trick, we substitute X > z = e!®, note that T;(i@) takes on 
only real values, and hope to find enough sign changes to locate all zeros using 
the Intermediate Value Theorem. In fact, it turns out that T;(z) is a trigonometric 
polynomial in sin or cos depending on ¢(f). For large values of k or N, results of 
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Pélya [39] and Szegé [44] then give strong conditions on the locations of the zeros, 
given unimodal properties of the coefficients of these polynomials. Overall, this 
boils the argument down (in the limit) to proving several handy inequalities for the 
competed critical L values of f. These are shown to follow from several arguments 
making keen use of the famous Hadamard factorization for A(f,s) in terms of the 
roots of A. 

This proof is only sufficient for modular forms of large weight or level. For the 
remaining cases, separate arguments can be directly constructed (for example, as 
we will look at in Sect. 2.5, if the weight is a small number like 4, the polynomial 
is only of degree 2 and can hence be studied somewhat directly). After this, finitely 
many cases remain, which may then be checked numerically, as the authors of [30] 
do using SAGE. 

We proceed to outline the proof of Theorem 7, which gives the version of the 
Riemann Hypothesis inherited by Z,(s) from the Riemann Hypothesis for Pr(s). 
Firstly, however, we need the result of Theorem 8. 


Proof (Proof of Theorem 8) This proof is a combinatorial exercise; we refer the 
reader interested in the details to [37]. 


Proof (Proof of Theorem 7) We first let 


Re (z) 


Ry (z) — Gp = Din 3 


where 6; j 1s the Kronecker delta function. By Theorem 9 and Lemma 1, we see that 
R; is a polynomial of degree k — 2 — 5_1 «f), all of whose roots lie on the unit circle, 
and such that R-(1) # 0. Thus we have 


Rez) Ry(2) 
a 


Applying Theorems 8, 9, and 12 with d = k — 1 — 6_1,.¢ yields the result, and in 
particular shows that the zeros of Z;(s) lie on the line Re(s) = 1/2. 
Proof (Proof of Theorem 10) The proof of Theorem 10 follows immediately 


from replacing the terms involving Lif,j + 1) by C(j+ 1), and appropriate 
normalizations. 


Proof (Sketch of the Proof of Theorem 11) Part (1) follows directly from the 
observation (which is easy to combinatorially verify) that the polynomials H;, (x) 
are Rodriguez- Villegas transforms of a x/ and that the He (x) are the transforms 
of x? +1. 

The fact that the zeros of H, (x) lie on the line Re(x) = 1/2 is one of the main 
examples of Theorem 12 in [5]. The precise location of its zeros in Theorem [1 is a 
direct restatement (and slight modification in the case of e(f) = +1) of Theorem 1.7 
of [5]. 
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The proof of part (2) follows directly from the more precise estimates on the 
locations (and in particular the equidistribution) of the zeros in Theorem 9, which is 
hinted at in Remark 10 and provided in Theorem 1.2 in [30]. As extracting roots of a 
polynomial is continuous in the coefficients of the polynomial, we have the desired 
convergence of the roots of Z;(s) in the limit to those of the polynomials He (x). 

Part (3) also follows by utilizing the precise estimates of Theorem 1.2 of [30] 
to determine the zeros of Ry(z) to high accuracy. The strict upper bound on the 
imaginary parts of roots follows from a careful consideration of Theorem 1 of [10], 
using the positivity properties of critical completed L-values reviewed in Lemma 1. 
When e(f) = —1, by Lemma | and the functional equation for A(f, s), we see that 
R,(x)/(1 — x) is a polynomial with all non-positive coefficients, so the Rodriguez- 
Villegas transform of Ry is the same as the Rodriguez-Villegas transform of this 
polynomial with the degree lowered by 1. This shows that we may again use 
Theorem | of [10], by applying it to the polynomial Ry(s)/(x — 1) with positive 
coefficients and whose transform has the same zeros as Z;(s). 


2.5 Examples 
2.5.1 Period Polynomials in Weight 4 


To give a flavor of the types of analysis going into the proof of Theorem 9 for small 
weights and levels, here we consider the case of weight 4 newforms. The period 
polynomial Ry is then quadratic, and we are concerned with the zeros of 


—2L(f, 1)m7X? + 2niL(f, 2)X + L(f,3) = 0. 


Clearly, we may then apply the quadratic formula. Now, the values L(f, 1) and 
L(f, 3) are also related via the functional equation for L(f, s), and the conclusion is 
thus trivial if L(f, 2) = 0. 

In the case when L(f, 2) 4 0, the theorem is equivalent to the estimate 


ete 3)” > L(f, 2)”. 
1s 


This can be shown using the Hadamard factorization of A(f, s): 


AG) = At? TT] (1 = *) exp(s/p) 


p 


Here the product is over all the zeros of A(f,s) (that is, the non-trivial zeros of 
L(f,s)), and A and B are constants. Now we always have 3/2 < Re(p) < 5/2, which 
means that A(f,3) > A(f, 2), and is enough to imply the desired inequalities. 
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2.5.2 Zeta Function for the Modular Discriminant 


It is interesting to consider the first level 1 newform, namely the normalized Hecke 
eigenform f = A € S}o(/0(1)). In this case, e(f) = 1 and we have 


36 36 
Ra(z) © 0.114379 + | ——z!° + 2 4 32° + 324 + 2 + —— 
A(z) ® 0.11437 (a +2432 4+ 32° +2 + 2] 


+0.00926927 - (42? + 252’ + 42z° + 2523 + 4z). 
The ten zeros of Ry lie on the unit circle, and are approximated by the set 
{i, —0.465 + 0.8857, —0.744 + 0.6687, —0.911 + 0.4117, —0.990 + 0.140i} . 


These are illustrated in the following diagram (Fig. 1). 

It is quite tempting from this picture to conjecture that the zeros of Ry always lie 
on one half of a unit circle. However, this small example is somewhat misleading: 
the zeros for large level or weight modular forms eventually have to equidistribute 
on the whole unit circle, as shown in [30]. Another manner in which this example 
is atypical is the following. The size of the coefficients of Rr follows a unimodal 
pattern. The same equidistribution-of-zeros result shows that eventually the coeffi- 
cients of Ry must be unimodal, but in the opposite direction! In fact, if the direction 
of this unimodality of coefficients were reversed, the Riemann Hypothesis for Ry 
would follow directly from its functional equation (showing it is a self-inversive 
polynomial) along with elementary results such as Theorem 1 of [14]. It is this 


Fig. 1 The roots of Ry (z) 
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Fig. 2 The roots of Z,(s) i 
5L $ 
+ i Ff 
-0.5 Ot 1 1.5 
“BL 4 


eventual “correct” unimodality that follows from the results of [30], and this drastic 
switching of phenomena that make the proof of Theorem 9 split into several cases 
and make its statement surprising. In fact, this line of reasoning allows one to 
conclude that for large weights and levels, the location of the zeros of Rr(s) on the 
unit circle is a relatively “stable” phenomenon, while a quick numerical experiment 
for a polynomial such as R, shows that the location of its zeros on the circle is very 
unstable, and hence rather astounding. 

We now consider the Rodriguez- Villegas transform; letting s +> —s we find that 

Za(s) © (5.11 x 1077)s!9 — (2.554 x 107%)5? + (6.01 x 107>)s8 — (2.25 x 1074)57 

+0.00180s° — 0.00463s° + 0.0155s* — 0.0235s° 


+0.0310s” — 0.0199s + 0.00596. 


Theorem 7 establishes that the zeros p of Z, satisfy Re(o) = 1/2; indeed, they are 
approximately 


1 1 1 1 1 
5 + 8.447i, 5 + 5.002i, 5 + 2.8461, 5 + 1.352i, 5 + 0.3491, ¢ , 


as illustrated in the next figure (Fig. 2). 


2.5.3 Ehrhart Polynomials and Newforms of Weight 6 


Here we consider newforms f € S6(1o(N)) with e(f) = —1. By Theorem 11 (2), the 
roots of Z(s) may be modeled (with improving accuracy as N — oo) on the roots 
of the Ehrhart polynomial of the convex hull 


conv {é1, €2, €3, —€1 — €2 — e3}. 


The following image displays this tetrahedron. 
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Fig. 3. The tetrahedron 7 
whose Ehrhart polynomial is 
Hg (s) 


z=0 


The corresponding Ehrhart polynomial counts the number of integer points in 
dilations of Fig. 3, and is given by the Rodriguez- Villegas transform of 1+-x+x?+2°. 
Namely, we have 


= s+3 s+2 stl RY 2, > 7 
H = == a 1. 
6 (S) @hesneene go Pe eet 


Therefore, we find that 
~ ~ 1 1 V-I11 1 <-l1 
an Zp (s) = He (—s) = (s- ) (:- + (:- - ). 
—>+00 


where the limit is over newforms f € S6(Jo(N)) with e(f) = —1, and where the 
polynomials with tildes have been normalized to have leading coefficient 1. 


2.6 Concluding Remark 


In both this and the previous section, we have surveyed a number of natural 
partition-theoretic forms emerging from the same stellar nursery as the Riemann 
zeta function. Indeed, it is evident that classical zeta functions represent a relatively 
small subclass in the vast universe of partition-theoretic objects. 
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Reading Riemann 


S.J. Patterson 


Abstract This is an essay about the reputation that Riemann has acquired as a 
difficult writer, a reputation which has, if anything, grown over the years. An 
awareness of the developments in mathematics, and, in particular, in mathematical 
physics at that time shows that Riemann was working in a very specific environment 
and that his thinking reflects this environment. It is therefore helpful to know 
something of this background. Riemann’s short paper on the distribution of prime 
numbers has been the subject of considerable discussion and we shall consider this 
paper in particular. 


1 Introduction 


Riemann has been considered a “difficult” author with some commentators on the 
verge of considering him to be a mathematical Heraclitus whose writings were 
described as “dark” in classical times (at least by Cicero). One particularly flagrant 
example of this is Lars Ahlfors’ discussion of Riemann’s dissertation in a volume 
published to mark its centenary.! Ahlfors writes: 


The most astonishing feature in Riemann’s paper is the breathtaking generality with which 
he attacks the problem of conformal mapping. He has no thought of illustrating his methods 
by simple examples which to lesser mathematicians would have seemed such an excellent 
preparation and undoubtedly would have helped his paper to much earlier recognition. On 
the contrary, Riemann’s writings are full of almost cryptic messages to the future. For 
instance, Riemann’s mapping theorem is ultimately formulated in terms which would defy 
any attempt of proof, even with modern methods. 


'L. Ahlfors, Developments of the theory of conformal mapping and Riemann surfaces through a 
century, In: Contributions to the theory of Riemann surfaces, Eds. L. Ahlfors, E. Calabi, M. Morse, 
L. Sario, D. Spencer, Annals of Mathematics Studies, 30, Princeton UP, 1953. 
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Another curious example is a statement by Hans Freudenthal* 


Riemann’s style, influenced by philosophical reading, exhibits the worst aspects of German 
syntax; it must be a mystery to anyone who has not mastered German. 


This has to be contrasted with a statement in the preceding paragraph of the same 
article: 


Riemann’s evolution was slow and his life short. What his work lacks in quantity is more 
than compensated for by its superb quality. One of the most profound and imaginative 
mathematicians of all time, he had a strong inclination to philosophy, indeed, was a great 
philosopher, Had he lived and worked longer, philosophers would acknowledge him as one 
of them. His style was conceptual rather than algorithmic and to a higher degree than that 
of any mathematician before him. He never tried to conceal his thought in a thicket of 
formulas. After more than a century his papers are still so modern that any mathematician 
can read them without historical comment, and with intense pleasure. 


To the present author’s taste Riemann’s writing is clear. However, it cannot be 
denied that there are difficulties, but this is by no means uncommon in reading 
authors of a different epoch. Many authors, and Riemann is included here, do not 
give references for all the background material that is used in a paper. Indeed the 
ideas may well be “folk-lore” and this is by no means limited to historical authors— 
as one well knows. Riemann was a mathematician, or, perhaps more precisely, a 
mathematical physicist, who was mathematically ambitious and employed the most 
modern tools at his disposal. The present-day reader is confronted with the task of 
reconstructing the knowledge and thought-patterns of a sophisticated mathematician 
of the 1850s and 1860s. It is not possible to understand Riemann in isolation— 
despite what Freudenthal writes at the end of the second passage quoted; both the 
group of older mathematicians in Gottingen and Berlin who influenced him and the 
published literature of the time are the context of Riemann’s work. 

Ahlfors indicates that in his dissertation Riemann did not give examples; what 
he leaves to one side is whether Riemann thought in terms of examples. In the 
case of the dissertation he took over the work of his predecessors, particularly 
on the geometrical side Gauss’ “Allgemeine Auflésung der Aufgabe die Theile 
einer gegebenen Flache etc.” (1822/1825)° and his geometrical conceptions started 
from here.* He was also clearly familiar with Cauchy’s work, much of which was 
decades old by this time. In other matters, for example, in the theory of non- 
linear hyperbolic differential equations and the theory of shock waves, Riemann 


?Riemann, Georg Friedrich Bernhard, Complete Dictionary of Scientific Biography, 2008 Charles 
Scribner’s Sons. 

3Gauss, Werke 4, 189-216. 

4See Henri Paul de Saint-Gervais, Uniformization of Riemann Surfaces, Revisiting a hundred-year- 
old theorem, European Mathematical Society, 2016, esp. Ch. 1. 


>For the chronology of Cauchy’s work see F. Smithies, Cauchy and the creation of complex 
function theory, Cambridge U.P., 1997. 
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had no compunction about working from an example. Indeed he writes in the 
“Selbstanzeige der vorstehenden Abhandlung’”® 


Wie fiir die Integration der linearen partiellen Differentialgleichungen nicht durch die 
Entwicklung des allgemeinen Begriffs dieser Aufgabe gefunden worden, sondern vielmehr 
aus der Behandlung specieller physikalischer Probleme hervorgegengen sind, so scheint 
auch fiir die Theorie der nichtlinearen partiellen Differentialgleichungen durch eine einge- 
hende, alle Nebenbedingungen beriicksichtigende, Behandlung specieller physikalischer 
Probleme am meisten geférdert zu werden, und in der That hat die Lésung der ganz 
speciellen Aufgabe, welche Gegenstand dieser Abhandlung bildet, neue Methoden und 
Auffassungen erfordert, und zu Ergebnissen gefiihrt, welche wahrscheinlich auch bei 
allgemeineren Aufgaben eine Rolle spielen werden.’ 


Generally speaking the views expressed by mathematicians as to their principles or 
methods frequently diverge from their ceuvre as a whole and represent their thoughts 
simply at the time of writing. We recall in this context that Klein was very impressed 
by how elegantly Riemann’s theory of the P-function led to Kummer’s results on 
the hypergeometric function and stressed the geometrical and conceptual aspects 
of his function theory. C.L. Siegel, on the other hand, in his paper on the Riemann 
Nachlass® stresses Riemann’s formal virtuosity, an aspect which has been reinforced 
by recent work of W. Gabcke (private communication). No mathematician of 
Riemann’s stature can be categorized’ by one or two simplistic phrases. 

Riemann’s dissertation and most of his other published mathematical papers were 
taken up relatively soon by young mathematicians like H.A. Schwarz, C. Neumann 
and later, from about 1880 onwards, by F. Klein and H. Poincaré; his work on 
theta functions and other aspects of the theory of algebraic functions was taken 
up by J. Thomae, G. Roch, A. Clebsch, P. Gordan, and others. Indeed Riemann’s 
theory ousted K. Weierstrass’ more algebraic approach to the theory of abelian 
functions which meant that the construction of an algebraic theory, which made 
no use of analytic techniques, was only completed in the 1940s, mainly by A. 
Weil.!° Riemann’s work on trigonometrical series was taken up by G. Cantor at 


STI, p. 208]. 

TTL, p. 208], p. 208—in English “Just as the integration of linear partial differential equations was 
not found by developing general concepts but rather through the treatment of special physical 
problems, it seems that also the theory of non-linear partial differential equations will profit most 
from a deeper treatment of special physical problems involving all aspects of those problems and 
indeed the solution of the problem which is the subject of this memoir has led to the development 
of new methods and concepts and to results which will probably play a role in more general 
investigations”. 

81, pp. 770-805, esp. p. 771]. 

°As an historical aside the word “category”, or rather its Greek precursor, was first used in its 
present sense by Aristotle and originally meant “accusation”. It is derived from “kata”, down, and 
“agora”, the market place/assembly place. 

OAs it happens Weil does not seem to have had any interest in Weierstrass’ work. In his 
thesis, A. Weil, L’arithmétique sur les courbes algébriques, Acta Math. 52 (1928) 1-35(= Geuvres 
scientifiques,I,11-45) he uses the theory of theta functions to develop what he needs about the 
Jacobian variety. See p. 26ff. C.L. Siegel who followed this up a few years later with his theorem 
on integral points on curves, in tiber einige Anwendungen diophantischer Approximationen, Abh. 
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the suggestion of Weierstrass and was the subject of many investigations. Likewise 
Riemann’s theory of linear and non-linear waves in air was taken up in the 1870s 
by Rankine and Rayleigh. His work in general was not seen as incomprehensible, 
merely as a challenge to the upcoming generation of mathematicians. Indeed W. 
Purkert!! has listed over 550 papers influenced by Riemann which appeared in 
the 25 years after his death, i.e. to the point when Riemann would have turned 
65. It should be added that Riemann would not have been appointed as Dirichlet’s 
successor in Gottingen if his work had been a collection of “cryptic messages to the 
future”. 

The role of Felix Klein in understanding Riemann is important. He “discovered” 
Riemann for himself around 1880 and his little book on Riemann’s function 
theory!” shows in detail how Klein approached Riemann’s work. He had from 
then on the enthusiasm of a convert. When after his collapse of 1882 he went in 
1886 to Géttingen it was to build up that university as a “centre of excellence” 
in mathematics inside the Prussian state. Klein used the myths of both Gauss 
and Riemann to further this project and had a very long-lasting influence on the 
perception of both mathematicians. '* 

Riemann’s paper on prime numbers seems to be something of an exception in 
that it took over 30 years before J. Hadamard made the first substantial progress. !* 
Riemann’s work was by no means forgotten in the intervening years and his ideas 
were quoted by a number of authors. There seems to have been a particular revival 
after the publication of Riemann’s Gesammelte mathematische Werke in 1876 but it 
took some time before this interest bore fruit. 

Before this event there had been a considerable interest in prime numbers 
both with Chebyshev’s work and its further developments, and in the calculation 
of extensive tables. H.J.S. Smith, who was very familiar with the contemporary 
continental mathematical literature, wrote in his Presidential Address to the London 
Mathematical Society in 1876, so around the time that Riemann’s Werke appeared, 
the following passage’: 


d, PreuBischen Akad. d. Wiss., Phys.-math. Klasse, 1929, Nr. 1 =(Ges. Abh. I, 209-266) used the 
same methods—see p. 44ff. When Weil came to develop the theory of Jacobian varieties he seems 
to have taken his thesis as his starting point. Siegel in later years was much more interested in 
Weierstrass’ work but was never interested in the general theory of Jacobian varieties. 


"11, p. 870ff.]. 


°F Klein, On Riemann’s theory of algebraic functions and their integrals, Dover, 1963—German 
edition, 1881, English translation, 1893. 

'3 Apart from his booklet on Riemann’s function theory Klein also wrote a very interesting essay 
on Riemann, delivered as an address in to a meeting of the Naturforscherversammlung zu Wien 
on Riemann’s birthday, 1894—see his Gesammelte Mathematische Abhandlungen, Bd. 3, pp. 482— 
497. 

'4For the history we refer the reader to Landau’s Handbuch. 


'SThis address was read on the 9th November 1876, so late in the year. He may therefore already 
have been in possession of a copy of the Werke. On the other hand, we know that Smith was very 
well informed about developments in number theory outside the UK as his Reports to the British 
Association testify. 
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As to our knowledge of the series of the prime numbers themselves, the advance since the 
time of Euler has been great, if we think of the difficulty of the problem; but very small if 
we compare what has been done with what still remains to do. We may mention, in the first 
place, the undemonstrated, and indeed conjectural, theorems of Gauss and Legendre as to 
the asymptotic value of the number of primes inferior to a given limit x (the former theorem 
assigns for this value the integral logarithm Li(x) = i ae the latter an expression of 
the form ingxma)" But these theorems are only approximately consistent one another, and 
are perhaps still less approximately true. The memoir of Bernhard Riemann, “Ueber die 
Anzahl der Primzahlen unter einer gegebenen Grosse”, contains (so far as I am aware) 
the only investigation of the asymptotic frequency of the primes which can be regarded as 
rigorous. 


It is clear that the word “rigorous” here has a rather different meaning to that 
used today. It is fairly clear that Smith does not mean “indisputable” but rather 
“theoretical” (or even ‘heuristic’) in contradistinction to “empirical”. These notions 
were not used then in the strict sense we employ today. 

If one examines the development of complex analysis in the second half of the 
nineteenth century, then one sees that it took quite some time before many of the 
ideas inherent in Riemann’s papers, for example, a flexible use of contour integrals, 
of which the Pochhammer integrals, due to C. Jordan (1887) and L. Pochhammer 
(1890) or the use of the principle of the argument!® only slowly became common 
knowledge and practice. These developments had to be achieved before the paper 
on prime numbers could be properly digested. We shall see that it was especially 
Riemann’s formula for N(T) that caused difficulty at the time, much more so than 
the actual calculation of the zeros. 


2 Prime Number Theory Before Riemann’s Paper 


The first published discussion of the distribution of the primes, apart from Euclid’s, 
was, as is well-known, in the first edition (1798) of Legendre’s Essai sur la théorie 
des nombres in which he suggested the well-known asymptotic law 2(X) ~ ne 
where, as usual, 2(X) denotes the number of primes < X. This approximation 
is numerically not entirely convincing and Legendre attempted to find a better 
approximation but his attempt went in the wrong direction. The attempt reflects 
the fact that the approximation a is numerically not all that good; indeed the 
approximation by the logarithmic integral is also not particularly good. After 
Legendre’s work there was, for a long time, little published on these questions. The 


next major advance in the theory of primes was Dirichlet’s theorem on primes in 


‘©The principle of the argument seems to have been very little used in the second half of the 
nineteenth century despite the fact that it is a part of Riemann’s approach to the theory of theta 
functions. For some details of the history see F. Smithies, /oc. cit. p173ff. It seems to have attracted 
little attention for quite some time; apparently a topological approach was unfamiliar to most 
mathematicians of the early and middle nineteenth century. 
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arithmetic progressions. This proof, presented to the Berlin Academy in 1837 and 
published in 1839, was a remarkable achievement. Dirichlet expected that he would 
be able to prove Legendre’s empirical law by the same methods (see the discussion 
in Landau’s Handbuch) but this proved illusory. Ten years later the young Riemann 
spent 2 years in Berlin and seems to have learnt of Dirichlet’s hopes. 

In 1848 the first real progress towards a proof of what we may call “the prime 
number theorem” was made by Chebyshev. His first paper on the subject seems 
to have been read to the Petersburg Academy on 24th May 1848 and published in 
Russian in 1849 and in French in 1851; it was reprinted in Liouville’s Journal de 
mathématiques pures et appliquées in 1852. In this paper Chebyshev uses, as one 
may formulate it, the theory of the zeta function as a function of a real variable 
combined with partial summation to show that if om x tends to a limit, then this 


limit is 1. He also discusses Legendre’s empirical law and argues that the better 
x 


approximation should be Li(X) rather than ioeX* In a second paper which appeared 
in Liouville’s Journal in 1852 directly following his first paper, but only 2 years later 
in the Mémoires of the Petersburg Academy he introduces his method of combining 
the prime decomposition of n! with Stirling’s formula to prove Bertrand’s Postulate. 

More or less simultaneously with Chebyshev’s work an English barrister and 
professor of law at University College, London, Charles J. Hargreave published a 
paper in the Philosophical Magazine in which he applies arguments similar to those 
of Chebyshev’s first paper to the distribution of prime numbers.” It seems that this 
was entirely coincidental. Hargreave’s paper is in parts rather verbose and by no 
means so elegant or profound as that of Chebyshev, which would have difficult for 
anyone to achieve, but he too came to the conclusion that Li(X) should be the better 
approximation to z(X) and discusses numerical evidence to support this thesis. 

On Christmas Eve, 1849 Gauss wrote a letter to his erstwhile student Johann 
Franz Encke (1791-1865) who was at this time a well-established professor of 
astronomy in Berlin and director of the observatory there. In this letter, which 
has been printed in Vol. 2. of Gauss’ Werke, Gauss explains how he has begun to 
investigate the distribution of the primes in 1792 or 1793 and had come to similar 
conclusions. One may well ask, why he did this at this point in time and why in a 
letter to Encke, who, as far as we know, had little interest in the theory of numbers. 

Gauss had a tendency to sit on his more mathematical results for long periods of 
time until someone else had either rediscovered them or was close to doing so. He 
behaved in a very ungenerous fashion towards Legendre as regards the method of 
least squares and towards Janos Bolyai, the son of his closest friend from his student 
days, in the matter of non-euclidean geometry. He behaved in a similar fashion 
towards Jacobi and Abel when they developed the first theory of elliptic functions 
and towards Jacobi when he constructed Jacobi sums and began to make progress 
towards the biquadratic and cubic reciprocity laws. Jacobi, who had a sharp tongue, 
made his irritation about Gauss’ behaviour clear. In a letter to Bessel from 3rd April 
1835 he wrote “Bei Gauss heisst es nicht: de mortuis nil nisi bene, sondern de 


"Nol. 35 (1849) 36-53. This paper is in the bibliography of Landau’s Handbuch. 
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mortuis et de vivis nil’, i.e. “Gauss’ motto is not “nothing but good about the dead” 
but rather “nothing about the dead or the living”.'* He also wrote to his brother 
Moritz Jacobi on the 21st September 1849,“Er [Gauss] hat in den 20 Jahren weder 
mich noch D[Dirichlet] jemals citiert”, i.e. He [Gauss] has not cited either me or 
Dirichlet in the [last] 20 years’”’.!° Gauss attempted to rationalize his behaviour from 
time to time, for example, with his motto pauca sed matura but his explanations are 
unconvincing. Rather Gauss was reluctant to share with others the renown coming 
from certain advances. In this he was not unlike Newton in his relationship to Hooke 
and to Leibniz. 

We do not know whether Gauss had learnt of Chebyshev’s work by the end 
of 1849. He had had a number of contacts with Petersburg but there seems to be 
nothing in his papers that would decide the question one way or the other. However, 
the Philosophical Magazine, in which Hargreave’s article appeared, was subscribed 
to in Gottingen and we know that Riemann, at least, read articles in it.2? We can 
conclude that Riemann had at least seen this paper and that Gauss had had the 
opportunity to see it. 

Chebyshev visited Berlin in the summer of 1852 and conferred with Dirichlet. 
Later that year Dirichlet spent some time in Géttingen, as we know from Dedekind’s 
Lebenslauf of Riemann. He spent time together with Riemann and so one can 
presume that Riemann was also informed about Chebyshev’s work at the latest 
by 1852.7! It seems therefore reasonable to conclude that the seeds for Riemann’s 
thinking about primes had been sown by then. 


3 In the Intermezzo 


Between Riemann’s paper and Hadamard’s work on the zeta function more than 30 
years elapsed. During most of this period one can find signs of interest in Riemann’s 
paper. A. Genocchi published a Italian version of Riemann’s paper in 1860.7” This 
is an interesting reworking of Riemann’s explicit formula. Genocchi discusses the 
experimental evidence for the prime number theorem. It seems as if he would have 
been dissatisfied with an error term as large as O(X z), In this he is reflecting 
Riemann’s own opinion expressed in the last paragraph of his paper where he also 


18K _-R. Biermann, Die Beziehungen zwischen C.F. Gauss und F.W. Bessel, Mitt. Gauss-Ges. 
Gottingen, 3 (1966) 7-20, esp. p. 18. 

loc. cit. 

0See his “Selbstanzeige der vorstehenden Abhandlungen”, [1, p. 208]. In fact one of the articles 
quoted there is in the same volume as Hargreave’s paper. 

21 See also Harold Edwards, Riemann’s zeta function, p. 5. 


224. Genocchi, Fermole per determinare quanti siano i numeri primi ad un dato limite, Ann. di 
Mat. pura et applicata, Ser. 1, 3 (1860) 52-59. 
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tries to explain why Li(X) — 2(X) is always positive.*> These considerations were 
rendered null and void by Littlewood’s theorem of 1914** in which he showed that 
the sign of the difference changes infinitely often. Genocchi does not discuss the 
location of the zeros. 

From 1874 onwards F. Mertens published a substantial series of papers dealing 
with prime number theory. At this time there were further, and more accurate, 
tables produced of prime numbers in which the validity of the conjectured prime 
number theorem was tested. Some felt the deviations were too large and looked for 
alternatives. At the latest after Riemann’s Werke appeared the function discussed 
there, namely Li(x) — 1Li(x?) - 1Li(xs) _ 1Li(xs) + 1Ti(xs) — +++ was examined 
to see if it were a better approximation, which, in the ranges considered, it appears 
to be. 

From the middle of the 1880s on some Danish mathematicians began to compute 
the zeros of the zeta function on the critical line. First, in a rough and unpublished 
calculation E. Meissel”° calculated a number of terms in the Taylor expansion of & 
around s = 5 and used this to compute some zeros. This computation was effected 
by carrying out a numerical integration of using Riemann’s integral representation 
of this function. Meissel also computed 2 (X) for some large values of X. 

Around the same time J.L.W.V. Jensen’’ used the expansion 


= 1 tas l-s 
(s—1t(s)-1= om (1+-) -1- rr 


m>1 


which converges in Re(s) > O and can be used to give an “elementary” analytic 
continuation of the zeta function, to compute the first 9 terms of the expansion 
around s = 1. He derived from this a rough estimate for the first zero. 

Later, in 1895, Gram?® used the formula, valid in Re(s) > 0, 


?3 At this point Riemann also describes the approximation Li(X) to 1(X) as “known”. 


4. E. Littlewood, Sur la distribution des nombres premiers.C. R. Acad. Sci. Paris 158 (1914), 
1869-1872. 

25See W.W. Johnson, Mr James Glaisher’s factor tables and the distribution of primes, Ann. of 
Math. 1 (1884) 15-28. 

?6See S.J. Patterson, The Riemann Hypothesis — a short history, In: Casimir Force, Casimir 
Operators and the Riemann Hypothesis, Eds. G. v. Dijk and M. Wakayama, de Gruyter, 2010, 
pp. 29-41. 

27)_L.W.V. Jensen, Sur le fonction ¢(s) de Riemann, Comptes rendus Acad. Sci. Paris 104 (1887) 
1156-1159. In J.-P. Gram, Note sur la calcul de la fonction ¢(s), Oversigt over det Koniglige 
Danske videnskabernes Selskabs Forhandlinger og dets Medlemmers Arbejter, 1895, 303-308, the 
author reports that he had received in 1887 a letter from T.J. Stieltjes who had also undertaken 
calculations and had found the approximation 14.5 for the first zero. 


Bloc. cit. 
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combined with the Euler-Maclaurin for the same purposes. Later, in 1903, he 
augmented this with a summation formula due to Abel. The Finnish mathematician 
Ralf Josef Backlund took up the idea in his thesis of 1916 but using a variant 
summation formula due to Lindeléf. 

It seems as this work was the inspiration for Hardy and Littlewood’s “approx- 
imate function equation” of 1923. As it turned out Riemann had already found a 
more precise formula but which he had not published.”? 

What these authors were not in a position to do was to verify that all the zeros of 
the zeta function, in the regions examined, were on the critical line. It was only after 
v. Mangoldt gave a proof of the precise version of Riemann’s estimate for V(T) in 
1905 that it became possible to test the Riemann Hypothesis. He had given a weaker 
version in 1895 but apparently no-one attempted to develop it numerically. The tests 
of the Riemann Hypothesis began with the work of Backlund who developed an 
effective version of v. Mangoldt’s formula (1918). 

Finally it is of some interest to read what a well-educated young mathematician, 
G.B. Mathews (born in 1861) wrote about Riemann’s paper in 1892, that is, just 


before Hadamard’s contributions began*”: 


[That the formula 7(X) = Li(X) ] is approximately [i.e. empirically] correct, does not 
in any way prove that this is a proper asymptotic formula. It seems clear that Gauss was 
led to it simply by observation, and it does not appear that he ever accounted for it in any 
theoretical way, The only satisfactory attempt to determine a general analytical formula 
for F(x) [i.e. 1(X)] appears to be that contained in Riemann’s celebrated memoir. This is 
confessedly incomplete, and the analysis which it contains is very peculiar and difficult: but 
because of its great importance, some account of it ought to be given. On the properties of 
the function I"(z) for a complex variable, which will have to be assumed in the course of 
the investigation, the reader may consult Prym, Zur Theorie der Gammafunction (Crelle, 
Ixxxii. 165) and various papers by Bourguet and Mellin in vols. i, ii, 11i, and viii of the Acta 
Mathematica.*! 


One can summarize these authors by saying that Riemann’s work was known and 
appreciated by a fair number of people in the period before the work of Hadamard 
but that it was not regarded as the last word, perhaps in the way that certain 
procedures are used in physics now which one knows work very well but where 
a proper deductive foundation is missing. It is also clear that before Hadamard none 
of these authors had gone beyond what Riemann had done. In the intervening years, 
however, the techniques of complex analysis had been developed and were available 
to a much larger group of mathematicians. 


°Carl Ludwig Siegel, iiber Riemanns Nachlass zur analytischen Zahlentheorie, Quellen und 
Studien zur Geschichte der Mathematik 2 (1932) 45-80, reprinted in [1, pp. 770-805]. Recently 
Wolfgang Gabcke has re-examined Riemann’s papers and reconstructed more of Riemann’s 
arguments. 

30Hadamard’s first paper on the zeta function was submitted for a prize offered by the Académie des 
Sciences de Paris in December, 1892 and published in 1893. For the history of Hadamard’s work 
see V. Maz’ya, T. Shaposhnikova, Jacques Hadamard, A Universal Mathematician, AMS/LMS 
History of Mathematics Series, 14, AMS, 1998, p. 53 ff. 


31GB. Mathews, Theory of Numbers, Second Edition, Deighton Bell, London, 1892, Sect. 233. 
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Some attempts were made to go further. Stieltjes made, in 1885, the first 
unjustified claim to have proved what is now called the Riemann hypothesis which 
shows that this was already considered to be a significant open question. What was 
done at this time has been absorbed into the corpus of analytic number theory and 
we refer to Landau’s Handbuch for the historical details. 


4 Riemann’s Mode of Writing 


We have seen that many authors have found difficulty in unravelling some of 
Riemann’s arguments. There seem to be a number of reasons for this. Klein writes, 
rather darkly, of “Anyone who clearly understands under which Riemann worked 
in Gottingen...”.°? Some of his difficulties are known, for example, the poverty 
and the illnesses plaguing his family. Klein’s formulation indicates, however, that 
some of the difficulties were associated with his life in Gottingen and here we have 
only partial information. He clearly had difficult phases but his wife, Elise, said that 
it was most inaccurate to represent Riemann as a hypochondriac and depressive.** 
His gloomy phases, like those of Eisenstein, seem to have been also to a large extent 
the result of external factors about which we have relatively little information. These 
phases, Klein suggests, seem to have made Riemann even slower at writing up his 
ideas than he might have been under more favourable circumstances. 

Leaving this aside there is one aspect that presents itself repeatedly. This is 
that Riemann does not indicate what mathematics he is assuming. Thus, in his 
doctoral thesis, he is assuming a knowledge of topology of some sort which he 
does not seem to feel needs justification. There was at this time just one book 
on topology, Johann Benedikt Listing’s Vorstudien zur Topologie. Listing was a 
physicist alongside Wilhelm Weber in Géttingen. The Physikalisches Kabinett 
neighboured the university library and so Riemann must have encountered Listing, 
who, like Weber, seems to have been a gregarious person, often. While Riemann was 
a student Listing, a polymath of quite remarkable breadth, lectured on such topics 
as machines, steam engines, and meteorology. He is best known, in appropriate 
circles, for his work on physiological optics (he wrote the first treatise on this topic). 
Later he accompanied Sartorius von Waltershausen, Gauss’ first biographer, on a 
vulcanological expedition to carry out measurements of the magnetic field near a 
volcano. He gave the concept of the equipotential definition of the surface of the 
Earth, due to Gauss, Stokes, and others, the name “geoid”. He also introduced the 
word “topology” (or rather “Topologie”) into mathematics. The phrases analysis 


>2See F. Klein, loc. cit., Preface. 
33Katrin Scheel, Der Briefwechsel Dedekind-Weber, de Gruyter, 2014, p. 310. 
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situs OY geometria situs was sometimes used in the same context. This phrase had 
been introduced by Leibniz, but with a different meaning—see Tait’s discussion of 
this point.** 

In his articles on Listing P.G. Tait’? has expressed the expectation that there 
would be much more in Listing’s papers on topology than was published in his 
two books but unfortunately his papers have never been edited.*© 

The relationship between Listing and Riemann is, unfortunately, not reflected in 
the papers which have been preserved and published but Listing did leave diaries 
in which his contacts are documented but these have not yet been made available. 
One can only surmise that Listing was the source of the topology Riemann assumes 
but even so this topology does seem to go beyond the Vorstudien.*’ Listing did not 
lecture on topology*® (at least not officially) and we remain rather in the dark as to 
whether Riemann had direct contact with him in his younger years or whether he 
had simply studied Listing’s book carefully. The anecdote recounted by his teacher 
Schmalfuss*? shows how quickly Riemann could absorb and retain mathematics 
from a written source. 

Riemann’s laxity in naming his sources seems to be the best way of interpreting 
an incident which occurred while Riemann was preparing his Habilitationsschrift on 
trigonometric series. The first section gives an interesting and illuminating survey of 
the theory up to that point. However, Riemann records here*? that Dirichlet provided 
him with a list of the literature which saved him a great deal of work. Riemann 
was, as one can see from his works, well-read. It would seem that he could easily 
remember the content but forgot the source. This “help” from Dirichlet may have 
been a fairly heavy hint that he should mend his ways (and perhaps not adopt Gauss 
as a role-model). 

The problem in reading Riemann is then in finding out what he knew and what 
he was using. There are two points one should bear in mind. First of all that 
he belonged to a circle of physicists in Géttingen and that he was very familiar 
with the literature of mathematical physics. The second point is that one can also 
gain some information about the literature available to Riemann in Gottingen. The 
historical collection in the Staats- und Universitatsbibliothek in Géttingen seems to 
still contain what was available in Riemann’s day. On top of the literature, during 


> 


34PG. Tait, Scientific Papers, I, pp. 85-86. 

35See P.G. Tait, loc. cit., Papers LXV and LXVI. 

36The editor of Johann Benedikt Listing, Beitrag zur physiologischen Optik, Ostwald’s Klassiker, 
147, Leipzig 1905, Otto Schwarz has expressed similar views in the area of physiological optics, 
see p. 51. 


37This book influenced J. Clerk Maxwell and P.G. Tait who learnt of it from Maxwell, see P.G. 
Tait, loc. cit. p. 86. Its significance is also made clear in the appropriate footnotes in I. Lakatos’ 
Proofs and Refutations, Cambridge U.P., 1976. 


38The lecture lists of this period were published in the Géttinger Nachrichten and Listing lectured 
on such topics as the theory of machines, on steam engines, and on meteorology. 


3°11, p. 852; also pp. 543, 829-830]. 
See [1, p. 578]. 
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his student years in Berlin, it seems to have been the lectures of Jacobi and Dirichlet 
that impressed Riemann most. There is no evidence of any kind of closer contact of 
Riemann to Jacobi but there is to Dirichlet. Since Dirichlet had over many years 
close connections to French mathematicians one may assume the Riemann was 
informed about their work even when this was not directly available to him. In the 
historical discussion in Riemann’s Habilitationsschrift there are many references to 
the French and Italian literature on the theory of Fourier series*! presumably from 
Dirichlet’s notes. 

One case where this is some interest to us is the long memoir of Binet (1839) on 
the gamma function. In particular Binet’s integral formula for log(J"(s)) gives the 
complex version of Stirling’s formula in a very usable form.” 


5 The Paper on Prime Numbers 


The paper on prime number can be considered as in three parts. The central idea is 
that there is a duality, given by an integral kernel, between the set of prime numbers 
and a certain set of complex numbers, the zeros of the zeta function. This is a very 
general theorem involving techniques applicable in other contexts. It may be seen 
as in the context of Riemann’s work on trigonometric series. 

We may see Euler’s formula 


sin(zz) = mz [[a —2/n’) 


m>1 


as the model for the introduction of the zeros of the zeta function; the ideas needed 
for the analytic continuation of the zeta function had been around for some time 
although it was Riemann who clarified the notion of “analytic continuation” in his 
thesis.** With the tools he had available from complex analysis it was much easier to 
prove the product expansion than it had been for his predecessors, especially those 
of the eighteenth century. 

The Euler product for the zeta function, Euler’s proof of the infinitude of 
primes, and the theory of exponential and trigonometric functions, including the 
product formula for the sine function all appear in Euler’s Introductio in analysin 
infinitorum. The logarithmic derivative of the two sides of the product formula, a 
little rewritten, gives 


41 Apparently Riemann did not intend to publish this work; it appeared posthumously through the 
offices of Dedekind a year after Riemann’s death. It has, of course, been extremely influential. 
See E.T. Whittaker and G.N. Watson, Modern Analysis, Cambridge U.P., pp. 248-251. 

*®See A. Weil, On the prehistory of the zeta-function, In: Number Theory, Trace Formulas and 
Discrete Groups, Symposium in Honor of Atle Selberg, Eds. K.E. Aubert, E. Bombieri, D. Goldfeld, 
Academic Press, 1989, 1-9. 
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a formula which is a consequence of the Poisson Summation Formula. We note, in 
passing, that the easiest determination of the Fourier transform of t b> 24 is by 
way of the calculus of residues; another is by way of the formula above. The use of 
special cases such as this (often in the form of kernels) to study general Fourier and 
trigonometric series was clear to Riemann, as his Habilitationsschrift shows. 

Euler’s first attempt at a proof of the product formula for the sine function was to 
regard the sine function as a generalized polynomial and to equate it with the product 
over the roots. Euler was, quite rightly, dissatisfied with this argument and in the 
Introductio gives a convincing proof. Riemann, like all serious analysts of his time, 
would have been familiar with the Introductio and it would presumably have struck 
him that Euler’s original argument could, in the right circumstances, be rescued. 
We know now the product theorems of Weierstrass and Hadamard which formalize 
this idea. These results show that one only needs an estimate on the growth of the 
function in the region considered and one can avoid the more special properties of 
the two sides (addition theorem, differential equation) which have been used in other 
proofs. 

From Euler’s formula one can develop the theory of the Bernoulli numbers and 
polynomials. One has with these tools both what one needs to develop the Euler- 
Maclaurin summation formula and the Fourier expansion of the periodic extension 
of the Bernoulli polynomials from [0, 1] to R. One approach to this is due to Poisson 
(1823).4 

The second part of Riemann’s paper, which is a mere sketch, is concerned with 
the problem of determining the zeros of more precisely. He gives the asymptotic 
formula and indicates that it is possible to compute the zeros. He even goes further 
and suggests that “the majority” of the zeros lie on the critical line. At least the 
second of the two integral representations of the zeta function given in the paper is 
clearly part of this programme. We know from the Nachlass that Riemann did do 
some calculations.*> There is an oral tradition of a “black notebook” of Riemann’s 


“4Poisson’s argument, where he derives the Euler-Maclaurin formula from the Fourier expansion, 
is summarized by G.H. Hardy in his book Divergent Series, Oxford U.P. 1949, Sect. 13.8. Poisson’s 
original work is “Sur le calcul numérique des Intégrales définies”, Mémoires de L’Académie Royale 
des Sciences de l'Institut de France, Année 1823, T. VI, published 1827, 571-602. It is worth 
remarking that Riemann ignored some of Poisson’s real achievements on the theory of Fourier 
series in the introduction to his Habilitationsschrift. Poisson introduces the Poisson kernel, already 
an innovation, and uses it to demonstrate that for a continuous function the Fourier series is what 
Hardy calls summation (A) to the function. Alone, and with the help of Abel’s theorem (1826) 
which states that if )* a, converges then lim,y—+1.<1 >> @px" = )° dn, this theorem is sufficient 
for many application. Dirichlet’s theorem, which takes up the notion of a kernel function but one 
which is much more difficult, is a spectacular piece of analysis but its full force is rarely needed. 


45H.M. Edwards,Riemann’s Zeta Function, Academic Press, 1984, pp. 157-158, C.L. Siegel, [1, 
p. 792], W. Gabcke has been following this up further. 
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which may or may not still be extant. This tradition seems to come from C.L. 
Siegel and it is attributed to him in print by H.-J. Klemmt,*° but the material is 
extremely fragmentary. There is one page which has been discovered to date in the 
Nachlass about asymptotic formula. This has been examined by H.-J. Klemmt.*7 
According to this Riemann assumed that an asymptotic expansion existed, derived 
its consequences, and then compared coefficients. Also, according to Klemmt it 
seems as if Riemann was somewhat uncertain of his ground here. Since one knows 
how he approached questions in the theory one theta functions one might expect 
that he used a strategy comprising of the principle of the argument combined with 
the functional equation and the asymptotics of the gamma function which had been 
determined by Binet.*® One also needs an estimation of the argument of the zeta 
function close to and inside the critical line. The standard argument using Jensen’s 
theorem would have been within Riemann’s powers but unfortunately nothing has 
been found to show that Riemann had taken such an approach. Exactly on which 
considerations he based his statement about “most” of the zeros being on the line 
and his formulation of the Riemann Hypothesis remain very obscure. It seems very 
unlikely that his results went very far. As far as the “black notebook” is concerned, 
since it was at one time apparently in Landau’s possession if it had significant 
content one would have heard about it. 

The final part of Riemann’s paper is a very brief discussion of the relation 
between the explicit formula and the experimental evidence. Riemann claims— 
without any mention of the nature of the zeros—that the order of magnitude in the 
error is even O(x2). He writes,” 


46H.-J. Klemmt, Asymptotische Entwicklung fiir kanonische WeierstraBsche Produkte und Rie- 
manns tiberlegungen zur Nullstellenanzahl der Zetafunktion, Nachr. Akad. Wiss. Géttingen, II 
Math.-Phys. Kl. (1984), Nr. 4, 28-52, esp. p. 47. According to this source the notebook was a 
collection of formule. 


4"loc. cit. 

48See J. Binet, Mémoire sur les intégrales définies eulériennes, Journal de L’école Polytechnique, 
XVI (1839) 240pp. esp. pp. 138 and 144 where Binet’s integrals for log [(s) are given. 

“That is, The known approximating expression F(x) = Li(x) is therefore valid up to quantities of 
the order x? and gives somewhat too large a value; because the non-periodic terms in the expression 
for F(x) are, apart from quantities that do not grow infinite with x: 


Li(x) — ALi(x?) — ALi?) — ALi@x3) + ALi@es) — ALi?) + --- 


Indeed, in the comparison of Li(x) with the number of prime numbers less than x, undertaken 
by Gauss and Goldschmidt and carried through up to x = three million, this number has shown 
itself out to be, in the first hundred thousand, always less than Li(x); in fact the difference grows, 
with many fluctuations, gradually with x. But also the increase and decrease in the density of the 
primes from place to place that is dependent on the periodic terms has already excited attention, 
without, however, any law governing this behaviour having been observed. In any future count it 
would be interesting to keep track of the influence of the individual periodic terms in the expression 
for the density of the prime numbers. 

Trans. by D.R. Wilkins, Trinity College, Dublin, 1998. 
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Die bekannte Niherungsformel F(x) = Li(x) ist also nur bis auf Gréssen von der Ordnung 
x2 richtig und giebt einen etwas zu grossen Werth; denn die nicht periodischen Glieder 
in dem Ausdrucke von F(x) sind, von Gréssen, die mit x nicht in’s Unendliche wachsen, 
abgesehen: 


Li(x) — ELi(x?) — ALi(et) — ALi@x) + ALi@es) — LLi@@t) +++ 


In der That hat sich bei der von Gauss und Goldschmidt vorgenommenen und bis zu x = 
drei Millionen fortgesetzten Vergleichung von Li(x) mit der Anzahl der Primzahlen unter 
x diese Anzahl schon vom ersten Hunderttausend an stets kleiner als Li(x) ergeben, und 
zwar wachst die Differenz unter manchen Schwankungen allmahlich mit x. Aber auch die 
von den periodischen Gliedern abhingige stellenweise Verdichtung und Verdiinnung der 
Primzahlen hat schon bei den Zahlungen die Aufmerksamkeit erregt, ohne dass jedoch 
hierin eine Gesetzmassigkeit bemerkt worden ware. Bei einer etwaigen neuen Zahlung 
wiirde es interessant sein, den Einfluss der einzelnen in dem Ausdrucke fiir die Dichtigkeit 
der Primzahlen enthaltenen periodischen Glieder zu verfolgen. 


One can only conclude that Riemann was so convinced that the Riemann Hypothesis 
was true that he did not consider the alternative. A little earlier he does mention 
the other possibility but he seems to forget it. Moreover he seems to expect that 
his formula is genuinely exact and that it gives a complete answer to the question 
concerning the distribution of the primes. Calculations show that it is a remarkably 
good approximation, with the steps at the primes emerging from the continuous 
approximation but one knows, by a theorem of Littlewood of 1914, that this degree 
of approximation is not generally true.>° 

One other point in Riemann’s paper is interesting. He refers to the calculations 
of Gauss and Goldschmidt, here Benjamin Goldschmidt (1807-1851), Gauss’ 
amanuensis at the Sternwarte. The contribution of Goldschmidt does not seem to 
be recorded elsewhere. 

The date of Gauss’ letter to Encke can probably be understood that Gauss had 
learnt about the recent progress in the study of the primes and wanted to establish 
his priority without delay. Encke was “his man in Berlin” where Jacobi and Dirichlet 
were. The intention was presumably that Dirichlet and Jacobi would learn of Gauss’ 
claim. There is no evidence, as far as I know, as to whether this happened, nor do 
we know how far Gauss’ letter was known to others at that time. Gauss seems to 
have informed others independently. For example, Riemann’s reference in his paper 
to the calculations of Gauss and Goldschmidt, and to Gauss’ speculations, seems 
to indicate a different source as the letter to Encke does not mention Goldschmidt. 
It is, of course, quite possible that Riemann learnt of this work Goldschmidt and 
discussed the matter with him but the latter died in 1851.°! This would seem to be an 
indication that Riemann got his information directly from someone from the circle 
of Gauss; if it were not Goldschmidt, Stern, Listing, or Weber would be the most 


See A.E. Ingham, The Distribution of Prime Numbers, Cambridge U.P., 1932, p. 105. 

>!Tn fact Riemann hoped to taken on his position in the Sternwarte but Gauss apparently recognized 
that he was destined for higher goals—s. [1, p. 578]. Riemann attended a lecture course by 
Goldschmidt on terrestrial magnetism in 1846—-s. [1, p. 575]. 
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likely sources. Amongst these Stern was the one who had the strongest interest in 
number theory and Dedekind records that Riemann attended several lectures of his. 
It was Stern who befriended Gotthold Eisenstein after Eisenstein visited Gottingen 
in 1844 and to whom Eisenstein sent a number of very personal letters; Stern seems 
to have been a surrogate father to Eisenstein who seems to have broken with his 
own family at an early age—he seems to have had a relationship with his father 
not unlike that of Franz Kafka with his. It was also Stern who made possible the 
contact between Eisenstein and Gauss.” The positive impression one is left of Stern 
is confirmed by Rudio’s graveside oration at Stern’s funeral in Ziirich.™° 

It is not clear if the papers of Chebyshev would have been available to Riemann 
in journal form—they do not seem to have been in the library in Géttingen— 
but they were known to Dirichlet so we can assume that Riemann learnt of their 
content from him. There is no evidence that he had any direct contact with Gauss 
in connection with prime numbers but Gauss does seem to have been informed 
about him and seems to have followed his progress with interest. One may surmise 
that Riemann learnt of Gauss’ work on prime numbers partly through Dirichlet, 
and partly through intermediaries in Gottingen. The letter to Encke seems only to 
have become generally known after it was reprinted in Volume 2 of Gauss’ Werke 
in 1863. This volume appeared while Riemann was still alive, a few years after 
he wrote his paper on prime numbers. It seems to have been edited by Dedekind, 
Stern, and Schering; we know that both Dedekind and Schering were moderately 
close to Riemann and one may suspect that the same is true of Stern. All three 
were members of the Konigliche Akademie der Wissenschaften zu Gottingen as was 
Riemann,™* which was, as a body, responsible for editing and publishing Gauss’ 
Werke. It is striking that there is nothing in the “Bemerkungen” on Gauss’ letter 
either by Riemann or referring to his paper. That Riemann, although he was a 
member of the K6nigliche Akademie was not directly involved with this issue is 
presumably because he had suffered a collapse in his health in 1862 and was in Italy 
for a long period to recover. That there is no reference to Riemann’s paper is striking. 


6 The Zeta Function and Bessel Functions 


The calculation of the real zeros of the € function, the zeros of the zeta function on 
the critical line, was not a question that caused too much difficulty at that time, 
either in principle or in practice. It seems to have been known that the integral 
representation given by Riemann in the form 


52See G. Eisenstein, Mathematische Werke, Bd. 2, Chelsea, New York, 1975, 898-904. 


°3 See F. Rudio, Erinnerungen an M.A. Stern, Vierteljahrsschrift der Naturforschenden Gesellschaft 
in Ziirich, v. 39-40 (1894-1895) 131-143. Note that Rudio stresses the friendship between Stern 
and Henle. 


54He had been elected in 1859. 
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f(t) = sf ero) =! cos (5108) dx 


is closely analogous to integral representations of Bessel functions. This was 
made explicit by G. Pélya in 1926°> but there are implicit indications that it was 
recognized much earlier. Thus an expert on the calculation of Bessel functions, E. 
Meissel, made some rough calculations of the coefficients of the € function around 
t = O and could use this for rough, non-rigorous estimates of a few zeros. E. Meissel 
was an expert numerical mathematician who also constructed particularly accurate 
tables, published in 1888, of the Bessel functions Jo and J;. The zeros of these 
functions had appeared in a number of investigations before 1850. Riemann uses 
the Bessel functions Jp and Ko in his paper of 1855 on Nobili rings.°° 

The zeros of the Jo appear in the Fourier-Bessel expansion which was introduced 
by Fourier in his Théorie de chaleur (1822) and the zeros appear in several other 
investigations which were undertaken in the central part of the nineteenth century. 
Stimulated by Airy’s work on optics G.G. Stokes developed a particularly effective 
method of computing the zeros of various Bessel functions (1850). 

Although Riemann does not quote Stokes*’ he makes use, in his paper on Nobili 
rings, of “halbconvergenten Reihen’*® which are those established by Stokes. It 
seems then quite probable that Riemann knew Stokes’ paper. We know that the 
volume of Transactions of the Cambridge Philosophical Society where it appeared 
was available in Gottingen; I am indebted to J. Mangei and H. Rohlfing of 
the Handschriftenabteilung/Historische Bibliothek of the SUB Gottingen for this 
information. Moreover Riemann indicates familiarity with other papers of Stokes in 
connection with the theory of shock waves.°° A transformation that Riemann applies 
in his paper on Nobili rings, pp. 89-90, and commented by the editors, is the same 
as one used by Stokes. Stokes does not use the methods of function theory and so his 
proof is in the context of real variables but is on the threshold of complex function 
theory. Stokes uses it in his paper to convert an integral introduced in 1838 by the 
Astronomer Royal, G.B. Airy in the course of certain optical investigations 


_ cos (Fo - xw)) dw 
0 


into a rapidly convergent integral 


5G, Pélya, Bemerkung iiber die Integraldarstellung der Riemannschen €-Funktion, Acta Math. 48 
(1926) 305-317. 


See [1, pp. 87-98]. 


57G.G. Stokes, On the numerical calculation of a class of definite integrals and infinite series, in 
Collected Works, Vol. 2, 329-357. 


8See [1, p. 91]. 
TL, p. 208]. 
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sf eTi/6 exp(—w? + xue?™'/) 4 oe Ti/6 exp(—w? + xue?™/3)dy 
0 


from which he can determine the differential equation which it satisfies. He uses this 
first of all to find a differential equation, not too different from the one that Airy’s 
function satisfies and from this, by iterating a geometrical comparison method finds 
a “semi-convergent” series which gives a very good approximation for the Airy 
function and which also gives a very effective method for computing the zeros. He 
emphasizes just how much easier his method is than Airy’s and gives tables. Stokes’ 
method also shows, in a later notation, the connection between Airy’s function and 
Bessel functions of order ; (theorem of Wirtinger (1897) and Nicholson(1909)). 
Stokes gives examples of further examples of his method applied to other Bessel 
functions. Also in this paper Stokes makes use, probably for the first time, of the 
principle of stationary phase.°° This is usually attributed to W. Thomson in a paper 
of 1887.°! 

There is a broad discussion of the history of the calculation of the zeros of Bessel 
functions in G.N.Watson, A Treatise on the Theory of Bessel functions, Ch.VIL. 
Oddly in this discussion Watson does not discuss Stokes’ method. Most of the 
methods he does look at use one or another integral representation. Watson does 
mention Stokes integral representation of the Airy function in Sect. 6.4 (from which 
Stokes derived the differential equation) and Stokes’ computation of the zeros of the 
Airy function and of Jo and J; (Sects. 15.52 and 15.53). 

We have indicated above that the zeros of the Bessel function Jo appear in the 
Fourier-Bessel expansion.© This is an expansion similar in form to Riemann’s 
explicit formula. What is particularly interesting here is that the analogue of the Rie- 
mann Hypothesis is actually valid as was fist proved by Fourier himself. Fourier’s 
proof was reworked by Hurwitz in 1889.°° There is a second proof, attributed by 
Watson to Lommel but which was also, apparently independently, discovered 
by Riemann and published at about the same time as Lommel’s proof by K. 
Hattendorf in his version of Riemann’s lectures on the partial differential equations 
of mathematical physics.® Fourier’s proof involves rational approximations to Jo, 
derived from the differential equation, to which one can apply the methods for 
studying the zeros of polynomials; the Lommel-Riemann proof involves an integral 
equation which is also derived from the differential equation and one might, with 


Joc. cit. p. 340. It is perhaps worth noting that Stokes’ idea was an extension, to an oscillating 
integral, of a method used by Laplace (1820) for non-oscillating integrals which have an 
exponential kernel; see E.T.Copson, Asymptotic Expansions, Cambridge U.P. 1965, Sect. 17. 


61 See the remarks in H. Lamb, Hydrodynamics, Dover, 1945, p. 395. 
©2.See G.N. Watson, loc. cit., Chap. XVIIL. 

63For the details see Watson, loc. cit., Sect. 15.27. 

4See Watson, loc. cit., Sect. 15.25. 


See K.Hattendorf, Partielle Differentialgleichungen und deren Anwendung auf physicalische 
Fragen. Vorlesungen von Bernhard Riemann, Braunschweig, 1869, Sect. 97. 
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some license, see it as a precursor of a proof in the realm of spectral theory in Hilbert 
space. The zeros of Jo are very regular in a way that the zeros of the Riemann zeta 
function are not. In particular Gram’s law describing the distribution of the zeros of 
the zeta function fails badly. This seems to rule out any approach to the Riemann 
Hypothesis along the lines of the Fourier proof; whether Hilbert’s expectation that 
a spectral approach could be successful cannot be ruled out but over a century of 
failed attempts counsels caution. 

In view of these considerations one can view the theory of the € function as 
analogous that part of the theory of Bessel functions that one can establish without 
the use of the differential equation (or some related property). The first step in 
Stokes’ paper of using, in effect, Cauchy’s theorem and the step using the method 
of stationary phase belong to this category. Riemann goes much further—Wolfgang 
Gabcke has informed me that Riemann actually uses the Sattelpunkt method in 
his notes. It was only much later that this became a standard technique and it is 
usually ascribed to P. Debye. However, it developed during the second half of the 
nineteenth century and Debye gives historical details. The idea, as he points out, 
began with a fragment of Riemann’s, written in Italian, dated October, 1863 and 
edited by H.A. Schwarz.°’ 


7 Changes in Our Understanding of Riemann 


In recent times there have been new documents published which help to deepen 
our understanding of Riemann’s thinking generally. In particular the Dedekind- 
Weber correspondence® throws a great deal of light on Riemann’s life and his 
associations with those around him. In particular his wife Elise (née Koch) appears 
in a much more active and supportive role than had been visible before. Besides this 
correspondence some letters to Wilhelm Weber were discovered recently by J.M. 
Marin, and published by W. Gabcke.® In particular a letter of 18th April, 1866, that 
is just 3 months before his death, show that Riemann was actively involved with 
the theory and physiology of the ear, more so than the fragment published in the 
Collected Papers’ would suggest. He was dissecting the ears of slaughtered animals 


66P Debye, Niherungsformel fiir die Zylinderfunktionen fiir groBe Werte des Arguments und 
unbeschrankt veranderliche Werte des Index, Math.Ann. 67 (1909) 535-558. 

6711, pp. 456-462]. For more information see E.B. Barnes, The Asymptotic Expansion of Integral 
Functions defined by Taylor’s Series, Phil. Trans. Royal Soc. London. Series A, Vol. 206 (1906), 
pp. 249-297, and A Memoir on Integral Functions, Phil. Trans. Royal Soc. London. Series A, Vol. 
199 (1902), pp. 411-500. 

68Katrin Scheel, Der Briefwechsel Dedekind-Weber, de Gruyter, 2014. 

6W. Gabcke, Transcriptions of six letters of Bernhard Riemann preserved in the SIL, 
arXiv:1602.08264 (2016). 

711, p. 370ff.]. In a footnote by E. Schering and J. Henle it is remarked that Helmholtz’ work was 
a stimulus for Riemann’s interest. Jakob Henle (1809-1895), probably not a name known to most 


284 S.J. Patterson 


in order to develop a mathematical theory of the ear. Perhaps he was inspired to 
this endeavour by Listing’s work on physiological optics. That he was involved 
in such very physical investigations opens questions as to exactly what state his 
health was in. His final journey to Selasca was apparently a flight from the war that 
was breaking out between Prussia and Hannover; the situation in Gottingen was 
corresponding chaotic.’' Dedekind in the Lebenslauf indicates very discretely that 
the journey was very strenuous. Both Archimedes and Kepler died through being 
caught up in wars and it seems as if the same may well be true of Riemann. This is 
a sobering thought at times like this when so many are in similar situations. 

In the article by H. Freudenthal quoted above that author puts forward the view 
that Riemann would have tended towards philosophy had he lived longer. These 
more recent finds make this unlikely if one understands “philosophy” in the sense 
of someone like Kant; if one understands the word in an old-fashioned sense as 
“Natural Philosophy”, then the remark would be justified. 

It is remarkable that although Riemann had the greater number of contacts with 
physicists, mathematical and otherwise, the works for which he is most remembered 
belong to what we would consider today to be pure mathematics. It is worth 
comparing Riemann with two of his near contemporaries G.G. Stokes (1819-1903) 
and W. Thomson (1824-1907) both of whom were very interested, and contributed 
substantially to, recent mathematical developments of their time, but in later years 
worked on very broad ranges of subjects. Hermann v. Helmholtz (1821-1894) had a 
similarly broad range on interests; in his case his mathematical interests developed 
rather later and in fact took up the question as to the nature of geometry about which 
Riemann had thought deeply. The indications are that Riemann would have been a 
kindred spirit of these scholars—but it is perhaps better not to speculate too far. It is, 
though, worthy of note that according to a footnote” by E. Schering and J. Henle, an 
anatomist in Géttingen, Riemann had been inspired to study the ear by reading the 
works of Helmholtz. It is perhaps worth recalling that it is recorded that “Riemann 
sang like a canary”.”° 

In quite recent times documents of Riemann’s and those with some sort of 
connection with him have been found and shed new light on Riemann’s life and 
work; one may assume that there are more that still many more unregarded in 
various archives. One knows little about Riemann’s life, for example, if Klein’s 


mathematicians, was a physiologist who worked from 1852 in Gottingen. He worked on many 
topics and was especially a gifted microscopist who studied the cochlea. Riemann was interested 
in studying the cochlea of slaughtered animals but in the letter in question explains that he had 
great difficulty in extracting the delicate cochlea from the bone in which it is embedded. For the 
historical context of Riemann’s letter see the second footnote on p. 145 of H. Helmholtz, On the 
sensations of tone, Dover, 1954. It is perhaps of interest to note that it was Henle to whom Dedekind 
submitted his draft of the Lebenslauf for comments; see. K. Scheel loc. cit., p. 112. 

71See K. Wettig, Als die Hannoveraner Preufen wurden, Géttinger Tageblatt, 20. Juni 2016, p. 16. 
PUL, p. 370]; see also [1, pp. 843-844]. 

®This was reported to (the notoriously unmusical) F. Klein by M. Stern—see F. Klein, Vorlesungen 
iiber die Entwicklung der Mathematik im 19. Jahrhundert, Bd. I, p. 249. 
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remark does refer to aspects of his life in G6éttingen in the middle 1850s which 
are not yet in the literature. Klein’s remark suggests that it would illuminate our 
understanding of Riemann’s development. Riemann does seem to have been much 
happier in Italy. We know very little about his relationships to Wilhelm Weber and 
Johann Listing. Of particular interest in this would be the papers of J.B. Listing. 
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A Taniyama Product for the Riemann Zeta 
Function 


David E. Rohrlich 


Abstract A Taniyama product for the Riemann zeta function is defined and an 
analogue of Mertens’ theorem is proved. 


1 Introduction 


Tucked unobtrusively into Taniyama’s memoir [4] on compatible families of €-adic 
representations is a curious identity expressing the zeta function of such a family as 
an infinite product of imprimitive Artin L-functions [4, p. 356, Theorem 3]. The 
simplest case of the identity (arising from the cyclotomic character, or from its 
inverse, depending on one’s conventions) is 


C(s—1)/t(s) = | | é(s). (1) 


c21 


where ¢(s) is the Riemann zeta function and ¢.(s) is the imprimitive Dedekind 
zeta function—imprimitive because the Euler factors at the primes dividing c are 
removed—of the cyclotomic field K, generated over Q by the cth roots of unity. 
Thus ¢)(s) is €(s) itself, €2(s) is (1 — 2~*)€(s), and so on. The infinite product 
converges for ‘i(s) > 2. 

In this note we modify (1) slightly so as to obtain a product for ¢(s) rather than 
€(s — 1)/€(s). Let K* be the maximal totally real subfield of K,, and let €*(s) be 
the Dedekind zeta function of K* with the Euler factors at the primes dividing c 
removed. Then 


cs) =[[ttGt+b. (2) 


c21 
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Like the traditional Euler product 


és) =[[a-p°y, (3) 
Pp 


the Taniyama product (2) converges for #(s) > 1. 

The main result of this note can be viewed as an analogue of Mertens’ theorem 
[2]. It bears the same relation to (2) as Mertens’ theorem does to (3), and Mertens’ 
theorem itself figures prominently in the proof. Let y denote the Euler-Mascheroni 
constant. 


Theorem 1 [].<, €*(2) ~ e” logx. 


Of course €* (2) can be computed explicitly in terms of generalized Bernoulli 
numbers. For a primitive Dirichlet character y of conductor gq, let 


= 0x00 /q-i+ 4/6). 


j=l 


Also write d+ for the discriminant of K+, and let y(c) be the cardinality of (Z/cZ)*. 
For the sake of a succinct formula we put 


g(c) ifc>3 
p(c) = 
2 ifc = lor2. 


Then 


co = 51 I bala? (4) 


qlc x mod q 
x(-)=1 


where the asterisk indicates that in the second product, y runs over primitive 
characters of conductor q. 

Expressions similar to (4) have arisen in other contexts. For example, nearly the 
same triple product occurs in a formula of Yu [6] for the order of a certain cuspidal 
divisor class group of the modular curve X;(N) (see also Yang [5, p. 521]). Even 
so, the differences between Yu’s formula and (4) appear to be significant enough to 
preclude a straightforward interpretation of Theorem | as an asymptotic average of 
cuspidal divisor class numbers. 

It is a pleasure to thank the referee for a careful reading of the manuscript. 


Taniyama Product 289 
2 Taniyama’s Identity 


A proof of (1) is of course subsumed in Taniyama’s proof of his general formula, 
but we will nonetheless sketch a proof here before going on to the modification (2). 
For a prime p not dividing c let f(p,c) be the order of the residue class of p in 
(Z/cZ)*. Also, write Z(s) for the right-hand side of (1). Then Z(s) can be written 
as the double product 


Zis)= I] [fa — pF) POMP), (5) 


c21 pte 


where the inner product is ¢.(s) and runs over primes not dividing c. The proof 
of (1) amounts to reversing the order of multiplication in the double product in (5). 
By choosing a branch of log Z(s) we can do the computation additively, and the 
absolute convergence of the resulting triple sum in the right half-plane R(s) > 2 
will show a posteriori that the original double product is meaningful in this region 
and that the calculation is legitimate. 

We define our branch of log Z(s) by 


—mf (p.c)s 
logZ(s) = )> > oy? 2 (6) 


c21 pte m21 


Putting d = mf(p,c) and summing over d = 1, we obtain 


log Z(s) = > ‘2 a #0 a 


c21 d21 pte 
f(p.c)|d 


or equivalently (since f(p, c)|d if and only if p4 = 1 mod c) 


log Z(s) = b> + PO re. 


p d2l c2l 
clp?—-1 


As ee y(m) = n, we conclude that 


log Z(s) = )° Y\ d7!p-*(p4 - 1). 


p d2\1 


The inner sum equals log((1 — p!~*)~!(1 — p~)), and (1) follows. 
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3. The Modification 


The proof of (2) is much the same. Put gt (c) = [K* : Ql], so that g*(c) is 1 if 
c = | or2 and ¢(c)/2 otherwise. (Note also that y* (c) = $(c)/2, where (c) is as 
in the introduction.) If p is a prime not dividing c, then the order in Gal(K* /Q) of 
a Frobenius at p will be denoted f* (p, c). Of course f* (p, c) is also the order of the 
coset represented by p in the quotient of (Z/cZ)* by the image of {+1}. Let Z*(s) 
denote the right-hand side of (2), and write Zt (s) as a double product: 


sft cS peas (ce + Cc 
z*(s) =] ]][ Ja-p f(D, ) eT OMT (0) | (7) 


c21 pte 
Define a branch of log Z* (s) by setting 
—mf* (p.c)s 


+ 
logZ*(s) => on 7 (8) 


c21 pte m21 


The calculation will again show that the triple sum is absolutely convergent for 
R(s) > 2. Putting d = mf*(p,c) and summing over d > 1, we find 


of gt (c) —ds 
ezo=-rye £4, 
c21 d21 pte 
fT (pela 
as before. But the condition f* (p, c)|d means c|p“ — 1 or c|p“ + 1, so we get 


+ 
bezt= Dy Eye. ) 


Pp d21 ¢l(pt+l) 


We emphasize that the innermost sum is the sum over all c such that at least one of 
the conditions c|p’ — 1 and c|p4 + 1 is satisfied. 

If c > 3, then the conditions c|p4 — 1 and c|p4 + 1 are mutually exclusive and 
gt (c) = g(c)/2, so we have 


1 1 
UV POs MV PO+5 De vO. (10) 
cZB3 ce3 cB3 
clpt#1 clp?-1 clp!+1 


On the other hand, if c = 1 or 2, then the conditions c|p“ — 1 and c|p4 + 1 are both 
satisfied (for if c = 2, then p is odd), but gt (c) = 1. So Eq. (10) is correct without 
the restriction c > 3, and the identity adi y(m) = n gives 


1 
DY vO =5(0'-)+O"+1) =p" (1) 
cl(pt£l) 
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Multiplying through by p~“’/d in (11) and inserting the result in (9), we obtain 


logZ*(s) =} log(1 =p); 
P 


or in other words 


[[&é@ =o6-0 (12) 


c21 


for K(s) > 2. Replacing s by s + 1 gives (2) for R(s) > 1. 


4 The Analogue of Mertens’ Theorem 


We shall prove that 


LT] cf @ ~ e” logx. (13) 


cXxt+1 


Theorem | is an immediate consequence of (13), because log(x — 1) ~ log x. 

We proceed as in the derivation of (2), but with two crucial changes: first, we 
take s = 2, and second, c now runs over the finite set of positive integers < x + 1. 
Thus (8) is replaced by 


+ —2mft (p,c) 
og T] C@= > ee a — (14) 


cXxt+1 cXx+1 pte 


Next we make the change of variables d = mf* (p,c). Since c runs over a finite set 
and the Dirichlet series for log &* (s) is absolutely convergent for i(s) > 1 and in 
particular for s = 2, we can rearrange the order of summation to obtain 


a 


lg [] F@=) DD CO|Z (15) 


cSxt+l1 Pp d2lcXx+l 
clpt+1 


as in (9). For the sake of notational simplicity, we conflate the double sum over p 
and d into a single sum over p“, and we put 


2d 
px) =—— LY ot. (16) 
ci 
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Then (15) can be written in the form 


log [[ SO=)) 904.94) F0.9+ D> O04. (17) 
cSt pt>x p>x pt<x 
d>2 


We shall prove the following assertions: 


Y> O(p7,x) = o(1). (18) 
pl>x 
d>2 
> P(p,x) = o(\). (19) 
p>x 
d P4.x) = log] [1 — po!) + of). (20) 
pl<x psx 


Granting these statements and using them in (17), we find that 


log [] ¢*(2) = log] [a —p')! +001), 


cXSx+1 pr 


whence exponentiation and an appeal to Mertens’ theorem give (13). 
To prove (18), we first note that 


Ys et © <p". (21) 


by (11). Thus &(p", x) < pra by (16), whence the left-hand side of (18) is 
bounded by the sum of the terms with p’ > x in the convergent double series 
a ye p “/d. Since the tail of a convergent series is o(1), we obtain (18). 

Next we prove (19). Take x > 20. It suffices to show that the sums 


De = De P(p, x) 
x<pS<x log x 


and 


= a > P(p, x) 


p>xlogx 


are both o(1). Appealing once again to (21) and (16), we see that 


DS DE Pe! = log(log(xlogx)/(ogx)) + o(1) 


x<p&x log x 
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(cf. Chebyshev [1]). But log(og(x log x)/(log x)) = log(1 + 0(1)), which is o(1). 
Thus the sum )-, is o(1). 
For )°, we revert to an earlier order of summation: 


Y= Uw woO Yep (22) 


cXxt+1 p=+1 modc 
p>xlogx 


We then rewrite the inner sum using Abel summation: 


a c,#1 CO Tn Bren +] 
b2 p= noes) |" +2 f wea (23) 
p=+1 modc xlogx xlogx y 
p>xlogx 


where z(y;c, +1) is the number of primes < y congruent to +1 mod c. By the 
strong form of the Brun—Titchmarsh theorem due to Montgomery and Vaughan [3], 
we have 

dy 


Using (24) in (23) and then inserting the result in (22), we see that 


yoy 7 a a 
2 xlogx y? log(y/c) 


ce<xt+1 


the term —z(xlogx;c,+1)/(xlogx)? having simply been omitted since it is 
negative. For x > 20 we have (xlogx)/c > e; hence the integrand is <y~* and 
the integral is <(x log x)~!. It follows that the sum over c is <(1 + 1/x)/(log x) and 
thus o(1). 

Finally we prove (20). The summation on the left-hand side of (20) is restricted 
to p’ < x, so if clp’ + 1, thenc < x + 1. Hence (p“, x) coincides with p~¢/d 
by (11) and (16), so 


oe o=) 9 a 


pi<x pesx 


We may write this identity as 


Y> op. = > YO p“/d, (25) 


pl<x PSX qe loss 
™logp 
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while 


log] [d-p')' =>0 op 4a. (26) 


pr px d21 


Subtracting (25) from (26), we see that 


log] [a -p7')'- 9) 04.9) = 0p“ /d (27) 


pSx pls<x PSX pt>x 


If p < xand p4 > x, then d = 2, so (27) gives 


log] [0-—p 7)! - )) 04.9) < Yop “Va. 
pw pl<x pt>x 


d22 


The left-hand side is positive by (27), and as noted previously, the right-hand side is 
the tail of a convergent double series, and therefore o(1). Hence the left-hand side 
is o(1), and (20) follows. 


5 The Special Value 


For the sake of completeness, we recall the standard calculation of ¢* (2) in terms 
of generalized Bernoulli numbers. Write ¢+ (s) as a product of Dirichlet L-functions 
associated with even Dirichlet characters to the modulus c: 


o= [| tex. (28) 
yolei 


We restrict attention to primitive characters by writing 
ciy=T] [] 26.0] [G-x@p. (29) 


qlc x mod q ple 
xC-))=1 


Now recall the functional equation of L(s, y): For x even and primitive of conductor 
q, let 


As, x) = FP a? T(8/2)L(s, 1); (30) 
then 


A(s, x) = WV)ACT —s,X), (31) 
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where W(x) is the root number of y. On the other hand, according to a classic 
formula we have L(1 — k, x) = —bx,,/k for integers k > 2 (and actually even for 
k = lif y # 1). Taking k = 2 and applying (31), we obtain 


L(2, x) = 77bngW()/9"?. (32) 
Next recall that if y has order > 3 then W(y)W(Z) = 1, while if y7 = 1, then 
W(x) = 1. Thus on substituting (32) in (29), we obtain 


* 


C2) =T]@/e 2" TT &,T]a-xop. (33) 
gle x mod q ple 
4x )=1 


where w*(q) is the number of even Dirichlet characters which are primitive of 
conductor q. Since )°,,,. ¥*(q) = y* (c) we have 


[ore = 70 (34) 
gle 
Furthermore 
Ile? = g, (35) 
qle 


as one sees, for example, by observing that the exponential factor in the functional 
equation of the Dedekind zeta function of K* is (d*)’/*, while the exponential 
factor in (31) or rather (30) is q'! 2- On substituting (34) and (35) in (33), we 
obtain (4). 


6 A Question 


For c > 3, let & be the quadratic Hecke character of K* associated with the 
extension K./K7, and let L(s, &) be the corresponding Hecke L-function. Write 
L.(s) for the imprimitive Hecke L-function obtained from L(s, &) by deleting the 
Euler factors at the primes dividing c. Also put L,(s) = 1 forc = 1 or 2. Then 
L-(s) = €-(s)/€F (s) in all cases. Hence combining (1) with (12), we obtain 


ty) =|] L). 


c21 


The infinite product converges for R(s) > 2, but 1/f(s) is holomorphic and 
nonvanishing for ‘¥(s) > 1. Is the true region of convergence perhaps much larger 
than {¢(s) > 2? We can offer only a minimal enlargement: 
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Theorem 2 The product ||, Lc(s) converges to ¢ (s)! for R(s) > 2. 


Proof For integers c > 3 and primes p } c, put k(p,c) = &-(p), where p is a prime 
ideal of K* lying above p. If c = 1 or 2 put «(p,c) = 0. Then 


L-(s) = [J — «pcp 98) 97 09 0.0 
pte 
for H(s) > 1, and consequently 


of . 
log I] L-(s) = S: LES Deon 


cXxt+1 cXxt+1 pte 


Making the change of variables d = mf* (p,c) as before, we obtain 


—ds 
og [] 20)=) OD ot (c)K(p, cy 9 F— 


c<xt+1 Pp d21\cXx+l 
clpt+1 


(note that the condition c|p¢ + 1 means precisely that f+ (p, c)|d). All of this is valid 


for {i(s) > 1, but we now assume that {i(s) > 2 or simply that N(s) = 2, since the 
case t(s) > 2 has already been dealt with. Put 


—ds 
Yipt.x.s) ="— DO ot upc 09. (36) 


Comparing (36) with (16), we see that 
|Y(p%, x, 8)| < P(p",x). (37) 


This relation will largely reduce the proof to our previous estimates. Indeed, as 
in (17), we can write 


log I] L.(s) = pa W(p",x,s) + > W(p,x,5) + > W(p",x, 8), 
cSx+1 pl>x p>x pl<x 
d22 


and the first and second sums on the right-hand side are o(1) by (18), (19), and (37). 
Thus to prove the theorem it suffices to show that 


S> W(p*.x, 8) = log f(s)! + o(]). (38) 


pt<x 


The argument will be similar to the argument for (20). 
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The first point is that the condition p? < x in (38) renders the condition c < x +1 
superfluous in (36). Thus in the context of (38) we have 


w if a/f* pc), 39 
(p".4,8) = PD eto) (39) 
clp?+1 
Now suppose that c|p’ + 1 (in other words that f+ (p,c)|d) and that c > 3. The 
conditions p’ = —1 mod c and K(p, Vit eo = —I1 are equivalent, because 
both are equivalent to the assertion that f(p,c) = 2ft(p,c) and d/f*(p,c) is 
odd. It follows that the complementary conditions, namely p’ = 1 mod c and 

k(p,c)/f* 9) = 1, are also equivalent, whence 
+6 I 
DY POxe.j"eo => D7 90-5 >» g(c). (40) 
ce3 c2=3 c23 
clpt+1 * es, 


As before, the restriction c > 3 can be eliminated throughout (40), because if c = 1 
or 2, then k(p,c) = 0. 
With the restriction c > 3 removed, (40) implies that 


Y et Oxe.ol9 = Sp4-)- t+) ==1., 


clpt+1 


and therefore (39) gives 


—ds 
LYotx)=-L yO a. (41) 


d< < logx 
pl<x PX 7c log 
ps ds logp 


On the other hand, 


log f(s)! =- 0 pr (42) 


p dz2l 
Taking the absolute value of the difference of (41) and (42), we find 
d - pe 
44.49) —S0)"'] < 
ptsx pl>x 


Since t(s) => 2 the right-hand side is the tail of a convergent series (namely the 
Dirichlet series for log €(s)) and is therefore o(1). Thus (38) follows and the proof 
of the theorem is complete. 
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